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SCIENCE IN INDIA 


A special issue explores India’s grand challenges pase 141 
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A nation with ambition 


India is making great strides in improving its science, but it needs to look carefully at its approach 
and work with the rest of the world if it is to realize its full potential. 


proudly states that “India is one of the top-ranking countries in 
the field of basic research”. 

It is true that India has made considerable progress in areas such as 
biotechnology, renewable energy and aerospace. But it is also mired 
in deep problems that impede innovation and are hampering the 
country’s progress. India has a puny scientific workforce, relatively 
few high-quality universities, an anaemic manufacturing sector and 
an epidemic of red tape. The result is that many Indian scientists head 
to other countries for training and jobs. 

It would be easy to argue that a lack of funding is holding India back 
and stopping it from becoming a science superpower. The country 
devotes less than 1% of its gross domestic product to research and devel- 
opment, which puts it far behind emerging nations such as China and 
Brazil, as well as the established economies of the United States and 
Europe. But more money will not cure India’s multiple science ills, as 
Nature documents this week in a special issue on the state of research in 
the country (see page 141). 

One of India’s biggest challenges is to boost its science to help drive 
national development. As a start, it must expand its research workforce. 
But that will require more high-quality universities and appropriate jobs 
for their graduates. The government is taking steps in the right direction. 
It has established tax incentives for research and development that are 
among the best in the world. These have helped to boost research invest- 
ments bya few industries, but have yet to drive widespread innovation. 

In tandem, India must tackle the bureaucratic morass that is imped- 
ing research and innovation. Scientists complain that funds for grants 
routinely arrive months late and that it can take years to fill positions. 
As a measure of the problem, one-third of the national laboratories, 
which are overseen by the prestigious Council of Scientific and Indus- 
trial Research (CSIR), lack permanent leaders (see page 144). Even the 
CSIR is run by a temporary director-general, Madhukar Garg, who 
told Nature that if the organization continues along these lines, “it will 
affect the national innovation system as a whole”. 

Prime Minister Narendra Modi, like his predecessors, has 
denounced the bureaucratic brakes holding back science, but there 
has been little progress here. A key to solving the issue is to elevate 
talented scientists who have administration experience into positions 
of responsibility. One example is Krishnaswamy VijayRaghavan, who 
is profiled on page 148. He is a gifted geneticist who in 2013 took over 
as head of the Department of Biotechnology, the leading funder of 
bioscience research grants. Among other changes, he is attempting is 
to streamline the notoriously cumbersome grant-application process. 

India could use some help. Compared with some other develop- 
ing nations, it has a relatively low level of international collabora- 
tion, even with the United Kingdom, with which it shares a unique 
history. It bodes well that the new UK minister of universities and 
science, Jo Johnson, has a strong interest in India. In fact, he co-edited 
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a book entitled Reconnecting Britain and India: Ideas for an Enhanced 
Partnership (Academic Foundation, 2012). 

India does, however, need to look closely at the changes it is making, 
because not all are positive. As part of its effort to encourage devel- 
opment, the Modi administration has tried to silence some critics 
of policies on energy, climate and human rights. In April, the Indian 
government revoked the registrations of thousands of non-governmen- 

tal organizations (NGOs) that receive foreign 


“The solution funds, and it has frozen the assets of Green- 
is not to silence peace over claims that it had violated reporting 
discussion rules about foreign contributions. On 6 May, 
or to shrink the US ambassador to India, Richard Verma, 
environmental warned about “the potentially chilling effects 


of these regulatory steps focused on NGOs”. 

Some scientists might be tempted to 
applaud India’s clampdown on environmental groups, which have 
stymied certain research initiatives. In March, environmentalists held up 
construction of a major neutrino observatory with debatable claims that 
the facility would harm an aquifer. And Nature reports this week that the 
Modi government has quietly moved forward with trials of genetically 
modified crops, which have long been desired by biotech researchers but 
have been impeded by environmental groups (see page 138). 

But scientists in India should not cheer the government's attempts to 
suppress dissent, even if it helps them to achieve their research goals. It 
would be wrong to blame environmental advocates for India’s lengthy 
and fault-ridden procedures for weighing up the impact of projects. The 
solution is not to silence discussion or to shrink environmental over- 
sight. Rather, India should make strategic improvements to the envi- 
ronmental evaluation process that balance progress with protection. = 


oversight.” 


Challenging times 


A European initiative to ban animal research 
has galvanized resistance. 


researchers since it was first proposed in 2012, but its long- 
trailed public hearing this week at the European Parliament in 
Brussels turned out to bea pretty grey affair. 

The duo who presented the initiative — which calls for the replace- 
ment of the 2010 European Union (EU) directive on the use of animals 
in scientific research with legislation banning all animal research — 
spoke calmly but unconvincingly. Their extremist claims, that animal 
models have no predictive value for human disease, drew thin and only 
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occasional ripples of agreement from a cluster of supporters seated at 
the back of the half-filled auditorium. 

The hearing was part of the EU’s move to expand direct democracy by 
introducing European Citizens’ Initiatives, which allow individuals to 
launch requests for legislation. A proposed initiative that collects at least 
one million signatures from at least seven EU countries wins the right 
to a public hearing in the Brussels parliament and obliges the European 
Commission to consider whether new legislation is warranted. 

As Kay Davies, an animal researcher at the University of Oxford, UK, 
wrote in Nature last week, in this case, it is not (see Nature 521, 7; 2015). 

For too long, activists have been left to dominate animal-research 
debates in many European countries. Their frequently inaccurate dec- 
larations — along with their not-infrequent physical attacks and death 
threats — have gone largely unchallenged by the scientific community 
and by the agencies and politicians who support the community's work. 
This has been slowly changing in the past few years, mostly thanks to 
the efforts of UK-based scientists and science organizations, who have 
emerged from their bunkers to set the record straight. 

Germany, despite its status as one of Europe’s major scientific 
powerhouses, has lagged behind in this effort. But a recent incident 
has sparked a remarkable change — one that should shore up support 
to protect the EU directive. 

One of the country’s top neuroscientists, Nikos Logothetis, a director 
at the Max Planck Institute for Biological Cybernetics in Tiibingen, last 
month gave up a long and painful struggle to maintain his primate labo- 
ratory, which had been targeted by animal activists. Unable to handle 
the death threats and insults to himself and his family, on 22 April he 
told local authorities who handle licences for animal experimentation 
that he would wind down his primate work and continue his work on 
rats only. This would mean reducing the scope of his research questions 
to levels still valuable for understanding general principles of neuronal 
action, but no longer directly translatable to human investigations. 

Logothetis’s problems began last September, when a German 
television channel aired a documentary using footage of his 
macaque monkeys secretly filmed by an animal-activist infiltrator. It 
seemed to show maltreatment of the animals. The resulting scandal 
led to a series of investigations that exonerated him and sug- 
gested that the behaviour of the monkeys had been staged for the 


camera. A police investigation is still going on. 

His decision to quit primate work dismayed many of his colleagues, 
and so did its timing. Coming so close to this week's public hearing, 
they feared that it would be presented as a victory for the Stop Vivisec- 
tion proponents. But something quite different happened: a swell of 
support for Logothetis and the type of primate research he carries out. 

First, politicians at the highest levels reacted with unprecedented 
speed and clarity to mount an unambiguous defence of the scientific 

use of non-human primate research. The 


“For too long, research minister in the state government 
activists have of Baden-Wiirttemberg, where Tiibingen is 
been left to located, condemned as cynical and exploita- 
dominate animal- __ tive the wild claims that Logothetis’s deci- 
research debates sion implied that research with monkeys was 
in many European  00t after all necessary. The federal research 
countries.” minister stated that such research was still 


crucial for developing treatments for brain 
disorders such as dementias. 

And a new policy of the Max Planck Society to be more open about 
its animal research showed its teeth. On 30 April, the society released a 
statement of regret about Logothetis’s decision and confirmed its own 
commitment to continue supporting research using non-human pri- 
mates. The society’s president, Martin Stratmann, a materials scientist 
who took office last year and who has selected the handling of the ani- 
mal issue as a priority for 2015, spoke out to pledge greater protection of 
its researchers against attack. At the grass roots, colleagues in Tiibingen 
launched a petition to support Logothetis that has received more than 
4,000 signatures from scientists around the world. 

This outspoken support has been echoed elsewhere. Parliamentary 
debates on animal research in Italy this week, where animal groups 
have been particularly active in the past few years, questioned rather 
than accepted animal-activist claims. Sixteen European Nobel laureates 
published an open letter in UK and German newspapers to rebut the 
Stop Vivisection campaign, joining a similar statement by 149 major 
research organizations and patient groups. 

The European Parliament has until 3 June to decide what to do. It 
should listen to the loud and unified voice of the continent’s scientists, 
and then do precisely nothing. = 


Polls apart 


The UK voter opinion polls show that an 
anomalous answer can be the correct one. 


office, but already the results of last week’s general election have 

got certain members of UK society fearing for their future. They 
are scorned by the tabloid press and social media; even serious observ- 
ers are questioning whether the country has been in thrall to them for 
too long. An inquiry has already been announced. 

Opinion pollsters, the media told everyone, were predicting the 
closest election for decades. Labour and the Conservatives were neck 
and neck; weeks of constitutional chaos would follow the election as 
mandarins and officials wrestled with competing and overlapping 
political claims to power. The small print says that opinion polls should 
always be taken with a decent pinch of salt. But who reads the small 
print when there is an election on and a 24-hour news cycle to fill? 

It took a single poll of voters post-voting to reveal the truth, which 
was confirmed as the counted results flooded in: David Cameron's 
Conservative Party had grabbed 37% of the vote (see page 134). That 
was nearly seven percentage points ahead of Labour and, crucially, 
well outside the margins of error ofall the previous deadlocked polls. 


B ritain’s new Conservative government has barely settled into 
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Amid the fallout, a single polling firm revealed that it had correctly 
predicted — and then buried — the result. Gathered the day before the 
election, its poll results seemed so out of line with what everyone else 
was saying that the firm did not dare to publish them. “I chickened out of 
publishing the figures,” confessed Damian Lyons Lowe, the chief execu- 
tive of Survation in London. “Something I’m sure I'll always regret.’ 

Nature’s readers can surely sympathize. The question of how to deal 
with anomalous data is a centrepiece of research, and the results can 
make or break careers — or launch scientific revolutions. From the dis- 
covery of the ozone hole over Antarctica to the observation that some 
people seemed unaffected by HIV infection, unusual results — data 
that make you go ‘hmmm — have led scientists to question their meth- 
ods, their knowledge and, ultimately, their understanding of the world. 

The importance of anomalies in science has spawned its own 
sub-field of research into how researchers respond to them. In the 
mid-1980s, psychologists supported by US military funds went as far 
as constructing a bespoke computer program to recreate how Hans 
Krebs reacted to surprising results during his discovery of the urea 
cycle in 1932. Others conduct in vivo studies by filming astronomers 
and physicists as they wrestle with unexpected findings. 

The ultimate test of anomalous data is, of course, to repeat the 
experiment. But that demands that scientists 
have the courage and insight to treat such results 
seriously in the first place. How many potential 
discoveries lie in the waste-paper bin of history 
because the cautious chickened out? = 
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WORLD VIEW  jennisicos sen 


of CRISPR tools to modify human embryos. As an ethicist in 
a genome-engineering lab, I am an eyewitness to these recent 
scientific developments and I do have concerns about the way gene 
editing could be used. But they are not the concerns you might expect. 

The ethical issues raised by human germline engineering are not new. 
They deserve consideration, but outcry over designer babies and preci- 
sion gene therapy should not blind us to a much more pressing problem: 
the increasing use of CRISPR to edit the genomes of wild animal popu- 
lations. Unless properly regulated and contained, this research has the 
potential to rapidly alter ecosystems in irreversible and damaging ways. 

Scientists have already used CRISPR to modify mosquitoes and the 
fruit fly Drosophila melanogaster. And in combination with another 
molecular-biology technique called gene drive, 
they have found a way to massively increase the 
efficiency of spreading these transformations 
to offspring and through the population. Once 
introduced, these genetic changes are self-prop- 
agating. If released beyond the laboratory, the 
effects would spread with every new generation 
and would quickly run out of control. 

Gene drive achieves rapid changes in a sexu- 
ally reproducing population because it relies 
on genes that are capable of preferential spread 
through generations. Without this, introduced 
traits meet the statistical obstacle of Mendelian 
inheritance and take hold in a population much 
more slowly. Altering wild animal populations 
using gene drive aims to rapidly disrupt a par- 
ticular trait, such as the ability of Anopheles 
mosquitoes to transmit malaria. It makes only a 
small-scale initial change to the relevant ecosystem and, in this example, 
the preliminary disruption would be restricted to the mosquitos natural 
habitat. But the risk of broader ecosystem disruption is unknown and 
would require extensive mathematical modelling to estimate. 

The gene-drive technique was developed in our lab, and in the initial 
publication of the method, my colleagues called for strict and vali- 
dated biosafety measures and public review and consent (K. M. Esvelt 
et al. eLife 3, 003401; 2015). Meanwhile, work that combines CRISPR 
and gene-drive techniques is marching on. In what they call a “muta- 
genic chain reaction’, scientists at the University of California, San 
Diego, have used the combined approach to alter D. melanogaster 
(V. M. Gantz and E. Bier Science 348, 442-444; 2015). To me and 
others, this research raises serious and significant fears about biosafety. 
The work was done in a lab, but should any of the 


G ene editing is a hot topic following a flurry of interest in the use 
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Regulate gene editing 
in wild animals 


The use of genome-modification tools in wild species must be properly 
governed to avoid irreversible damage to ecosystems, says Jeantine Lunshof. 


certainly be allowed, but only under the strictest conditions and with 
appropriate safeguards. 

In less than three years, CRISPR has become a key tool for biologists. 
‘Should they stop before it is too late?’ is therefore an immaterial question. 
Careful assessment of the various applications of CRISPR shows that there 
is no single or universal ethical evaluation that could cover all of them. 
The different consequences of human genome modification in either 
somatic cells or the germ line, and the modification of the ecosystem 
through gene drives, call for different ethical and policy evaluations. 

Some critics argue that the unpredictable effects that human germline 
genome editing could have on future generations make it dangerous 
and ethically unacceptable. Uncertainty, however, is not a useful way 
to judge ethical acceptability. Others highlight the potential non- 
therapeutic purposes of germline modification. 
From the standpoint of ethics, it is not clear why 
trait modification is by definition a bad thing. 
Moreover, the criteria for what is therapy and 
what is enhancement are fluid. 

The consequences of modifying human 
genomes will be limited because effects will 
always be restricted to humans — the index 
person and their line of descendants. Biosafety 
and biosecurity risks are not apparent at this 
moment. Regulation, if and when it comes, 
might need to be adapted to the local situa- 
tion, to existing legislation and to cultural and 
religious normative frameworks. 

Presented in those terms, the human appli- 
cations of CRISPR are much less troubling than 
the possibility of ecosystem modification. By 
definition, such disruption has more severe, 
complex, system-level consequences, and the breadth of its impact and 
the duration of its effects are hard to model. Gene drives are designed 
to be reversible, but this still needs to be tested. Self-propagating modi- 
fied organisms cannot be contained within national borders and pose 
major challenges for regulation and governance. We therefore need an 
urgent review of biosafety and biosecurity protocols for experiments 
— both in the lab and in field-scale trials — that combine CRISPR and 
gene-drive techniques in wild organisms. Funders and institutions 
must lay out and enforce regulations. 

The work with gene editing has thrown a useful spotlight on these 
bioengineering tools. But from an ethical perspective, the question we 
should ask is not what CRISPR can do for humans, but what humans 
can do with CRISPR. = 


Jeantine Lunshof is assistant professor at the University of Groningen, 
the Netherlands, and a visiting scientist at Harvard Medical School, 
Boston, Massachusetts, USA. 

e-mail: jelunshof@genetics.med.harvard.edu 
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RESEARCH HIGHLIGHTS 


Selections from the 
scientific literature 


Crystal harbours 
molecular shuttle 


A ring-shaped molecule that 
hops between two sites inside 
a porous crystal is the first 
molecular shuttle to operate in 
a solid-state material. 

Molecular shuttles could 
one day act as switches to 
store data if they are held ina 
well-ordered array. A team led 
by Stephen Loeb and Robert 
Schurko at the University 
of Windsor, Canada, built 
their shuttle inside a metal- 
organic framework (MOF): 
acrystalline scaffold made 
from metal-containing nodes 
connected by carbon-based 
struts. These struts bore a 
circular rotaxane molecule 
that moved back and forth 
283 times per second at room 
temperature. 

The estimated high density 
of shuttles in the MOF shows 
the potential for enormous data 
capacity if such switches can be 
controlled, the authors say. 
Nature Chem. http://doi.org/4f2 
(2015) 


Gut biofilms could 
spur cancer 


Chemicals secreted by gut 
bacteria are linked to human 
colon cancers. 

Metabolites called 
polyamines are made by 
gut bacteria to help them 
to form sticky aggregates 
called biofilms, and are used 
by human cells to regulate 
proliferation. Cynthia Sears 
at Johns Hopkins University 
in Baltimore, Maryland, Gary 
Siuzdak at the Scripps Research 
Institute in La Jolla, California, 
and their colleagues compared 
tissue samples from human 
colon cancers to those from 
healthy people, both with and 
without biofilms. 


PALAEONTOLOGY 
Early modern bird 


Two newly discovered fossils from China are the oldest 
relative of modern birds found so far, and had relatively few 


traits of their dinosaur ancestors. 


Min Wang and Zhonghe Zhou at the Institute of Vertebrate 
Paleontology and Paleoanthropology in Beijing and their 
colleagues describe two fossils from northeast China dating 
to 130.7 million years ago. The new species, Archaeornithura 
meemannae (pictured), boasted elaborate plumage — 
including a fan-shaped tail — and other defining features of 
modern birds such as a wishbone. This suggests that even these 
early birds, which are six million years older than previously 
discovered bird fossils, had already shed key dinosaur traits. 

A. meemannae belongs to a family of wading birds, hinting 
that modern birds originated near water, the team says. 


Nature Commun. 6, 6987 (2015) 


They found that cancer 
tissue with biofilms had 
62 times more of the 
polyamine metabolite N',N’?- 
diacetylspermine than did 
healthy tissue with biofilms. 
Yet in samples that were 
biofilm-free, the cancer tissue 
contained only around 7 times 
more polyamine than the 
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healthy sample. Antibiotic 
treatment reduced levels of this 
metabolite, suggesting that it 
comes from bacteria. 

Therapies that target 
polyamine formation and 
biofilms could be a way to treat 
colon cancer, the authors note. 
Cell Metab. http://doi.org/4jz 
(2015) 
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Growing extremes 
in California rains 


California's ongoing drought 
is a result of natural variability, 
at least for the 2013-14 period. 
But the state could see larger 
swings in wet and dry seasons 
by the end of this century 
owing to climate change. 

Neil Berg and Alex Hall of 
the University of California, 
Los Angeles, used 34 climate 
models to study how 
precipitation extremes might 
change in California. They 
expect that between 2060 and 
2100, the normally wet winter 
will be extremely dry twice as 
often as today, and extremely 
wet three times as often. The 
fluctuations could raise the risk 
of drought and flooding. 

These changes could push 
the state’s water supply to its 
limit. 

J. Clim. http://doi.org/4fz (2015) 


EVOLUTION 


Bird beak to 
dinosaur snout 


Chicken embryos with 
dinosaur-like faces provide 
clues as to how bird beaks 
evolved from dinosaur snouts. 
Early in bird evolution, 
the twin bones that form the 
snout in dinosaurs and reptiles 
— the premaxillae — grew 
longer and joined together, 
eventually forming the beak. 
Bhart-Anjan Bhullar, now 
at the University of Chicago 
in Illinois, Arhat Abzhanov 
at Harvard University in 
Cambridge, Massachusetts, 
and their colleagues analysed 
the development of beaks in 
embryonic chickens and emus, 
and snout development in 
reptiles such as alligators. 
They found that two proteins 
involved in facial development, 
FGF and Wnt, might have a 
role in beak evolution. When 
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they inhibited these genes in 
developing chicken eggs — 
making the expression pattern 
more like that of reptiles 

and other vertebrates — the 
premaxillae were separate and 
much shorter in some chicken 
embryos than in others. (None 
was allowed to hatch.) 

X-ray scans of the embryonic 
skulls showed that they more 
closely resembled the bones of 
early birds and dinosaurs than 
those of modern chickens. 
Evolution http://dx.doi.org/ 
10.1111/evo.12684 (2015) 


Ants dig differently 
depending on dirt 


Fire ants adjust their digging 
behaviour when building 
underground nests, depending 
on the size and dampness of 
the material they are working 
with. 

Daniel Goldman and his 
colleagues at the Georgia 
Institute of Technology in 
Atlanta studied fire ants 
(Solenopsis invicta; pictured) 
as they constructed tunnel 
networks in silica particles 
designed to mimic soil of 
varying moisture levels and 
particle size. Using X-ray 
imaging, the team found that 
up to a point, ants dug deeper 
tunnels through clay-like and 
fine-grained particles as the 
moisture content increased, 
using the growing stickiness 
of the particles. When 
digging, the ants either carried 
individual grains away if they 
were large, or gathered and 
compressed smaller particles 
into a pellet. 

The authors say that this 
adaptability could help to 
explain how this species has 
successfully colonized large 


parts of North America. 
J. Exp. Biol. 218, 1295-1305 
(2015) 


Farthest galaxy 
measured 


Astronomers have observed a 
distant galaxy as it looked just 
650 million years after the Big 
Bang, making it the farthest 
galaxy to have its distance 
reliably measured. 

Pascal Oesch at Yale 
University in New Haven, 
Connecticut, and his 
colleagues used a telescope at 
the W. M. Keck Observatory 
in Hawaii to study the galaxy 
EGS-zs8-1. To gauge its age, 
they measured how much its 
light had been stretched, or 
redshifted, as the light travelled 
across the expanding Universe. 
They found that the object is 
surprisingly bright and massive 
for such a young galaxy — a 
sign of rapid star formation. 

Their analysis adds weight 
toa theory that the peculiar 
colours of early galaxies, 
compared to later objects, may 
be the result of interaction 
between these rapidly forming 
stars and the gas around them. 
Astrophys. J. Lett. 804, L30 (2015) 


Silenced gene 
keeps malaria out 


Researchers have uncovered a 
possible target for anti-malarial 
therapies by suppressing gene 
expression in blood stem cells. 

Malaria-causing parasites 
attack mature red blood cells, 
which lack DNA, making it 
hard to test which genes make 
the cells vulnerable to malaria 
infection. Manoj Duraisingh 
at the Harvard T. H. Chan 
School of Public Health in 
Boston, Massachusetts, and his 
colleagues have sidestepped 
this problem by studying the 
stem cells that develop into red 
blood cells and contain DNA. 

They used RNA molecules 
to knock down gene activity in 
the stem cells, induced them 
to develop into red blood cells 
and then exposed them to 
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SOCIAL SELECTIO 


Popular articles 
on social media 


Mystery of telescope signals solved 


A report on the surprising origins of rogue signals picked up 
bya radio telescope has been simmering on social media. 
After more than four years, researchers using the Parkes radio 
telescope in New South Wales, Australia, have identified the 
source of the mysterious signals: a microwave oven in the 
facility's break room. The news quickly spread on Twitter. 
“And the result of the story: don’t use microwaves next to 
radio telescopes!” tweeted Karina Voggel, an astronomy PhD 
student at the European Southern Observatory in Garching, 
Germany. “A nice reminder that not all radio transients are 
microwave ovens — even if some are,’ tweeted Chris Lintott, 
an astrophysicist at the University of Oxford, UK. 
http://arXiv.org/abs/1504.02165v1 (2015) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


Plasmodium falciparum — the 
most deadly malaria pathogen. 
They found that the parasite 
binds to a cell-surface protein 
called CD55 and could not 
infect cells that did not produce 
it. Red blood cells can function 
normally without CD55, 
suggesting that the protein 
could be a therapeutic target. 
Science 348, 711-714 (2015) 


EPIGENETICS 


Mammary cells 
have a memory 


Specific DNA changes make 
the mammary glands in 
pregnant mice gear up for milk 
production faster if the animals 
have been pregnant before. 
Many women report that 
breast-feeding becomes 
easier with each pregnancy. 
To find out why, Gregory 
Hannon at Cold Spring Harbor 
Laboratory in New York and 
his colleagues gave pregnancy 
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hormones to female mice and 
monitored changes in their 
mammary glands over time. 

In mice that had been 
pregnant once before, the milk- 
producing ducts expanded 
in number more quickly 
(pictured, right) and generated 
milk proteins sooner than the 
ducts of mice that were first- 
time mothers (pictured, left). 
Genetic analysis of mammary- 
gland cells after the animals 
had stopped lactating showed 
that a previous pregnancy 
resulted in a long-term loss of 
methyl groups from DNA in 
genes that are activated during 
lactation. 

This epigenetic ‘memory’ 
primes these genes for fast 
reactivation in a subsequent 
pregnancy, the authors say. 

Cell Rep. http://doi.org/4j2 (2015) 
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UK election impact 


The incumbent Prime 
Minister David Cameron 

and his Conservative Party 
won the United Kingdom's 
election on 7 May with a 
surprise outright majority that 
flummoxed pollsters. The win 
has implications for research 
because ministers are likely to 
emphasize austerity allied to 
economic growth — a pillar of 
the previous Conservative-led 


JAMES GIAHYUE/REUTERS 


coalition government. There 
are no promises to protect 
science funding, which has 
been frozen for five years. 
The Conservative pledge to 
hold a 2017 referendum on 
Britain withdrawing from 
the European Union could 
adversely affect UK science if 


RAY BARKER 

Nature’s viability and health 
is as much dependent on its 
business leaders as on its 
editors. It is with sorrow that 
we record the death of Ray 
Barker (1936-2015), who 
was managing director of 
Macmillan Magazines from 
1991 to 1999. He had a deep 
knowledge of publishing, 
and as publisher of Nature 
he increased financial 
robustness following a period 
of weakness. He also oversaw 
substantial growth of the 
stable of Nature research 
journals. His colleagues 
remember his exceptional 
ability to recognize their 
contributions; according to 
Stefan von Holtzbrinck, of 
the publishing family that 
bought Macmillan during 
Ray’s tenure, “Many of the 
company’s achievements 
stemmed from Ray’s 
unique gift to spot and train 
extraordinary talents. His 
charisma inspired everyone 
to do the best possible job 
and go the last mile.” 
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Liberia celebrates becoming Ebola free 


Liberia has become the first of the three main 
countries affected by Ebola to be declared 
officially free of the disease, ending its 
15-month epidemic. The last person in Liberia 
known to have Ebola died on 27 March and 
was buried the next day. This means that, as of 
9 May, the country has had no new cases for 


the country leaves. Jo Johnson, 
a modern-history graduate, 
former banker and journalist, 
has been appointed as Minister 
for Universities and Science. 
See page 134 for more. 


Arab Mars mission 
The United Arab Emirates 
plans to launch a spacecraft 
to Mars in 2020, it announced 
on 6 May. The probe, named 
Hope, will be the first Arab 
spacecraft to go to the red 
planet. It will aim to model 
Mars’s atmosphere from 

an elliptical orbit, studying 
temperature, gases and 
weather phenomena on 

each pass. It will be managed 
by the UAE Space Agency 
and controlled from the 
Mohammed bin Rashid Space 
Centre in Dubai. Partners 

in the mission include the 
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University of Colorado 
Boulder, the University of 
California, Berkeley, and 
Arizona State University in 
Tempe. 


| _BUSINESS 
Mega-merger bid 


Monsanto announced on 

8 May that it had attempted 

to buy another leading 
agricultural-technology 
company, Syngenta of 

Basel, Switzerland. The 

offer — 449 Swiss francs 
(US$481) per Syngenta share, 
or about US$45 billion in total 
— was rejected, with Syngenta 
arguing that Monsanto, based 
in St Louis, Missouri, had 
undervalued the firm. If the 
purchase were to be successful 
eventually, it would unite 
Syngenta’s business, a leader 
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42 days (twice the maximum incubation time) 
since the last burial — the criterion used by 
the World Health Organization (WHO) for 
declaring a country Ebola free. But with cases 
continuing in Sierra Leone and Guinea, the 
WHO has warned against complacency. See 
go.nature.com/mhimlt for more. 


in pesticides and herbicides, 
with Monsanto’ strong seed 
business, which trades in 
both conventionally bred and 
genetically engineered crops. 


Drug-use panel 
Johnson & Johnson has set up 
an independent bioethics panel 
to decide whether to grant 
patients access to experimental 
medicines, the New Jersey 
drug firm said on 7 May. 

The panel will be chaired by 
Arthur Caplan, a bioethicist 

at New York University. It 

will operate on a trial basis 

to consider compassionate- 
use requests for a single, 
experimental drug that is 
being developed by the firm’s 
subsidiary Janssen. More than 
20 US states are considering 
‘right to try’ laws that would 
encourage drug companies to 


SPACEX 


SOURCE: THE FUTURE OF SOLAR ENERGY (MIT, 2015) 


offer investigational medicines 
to patients. See go.nature. 
com/84wobé6 for more. 


Rare-disease deal 


Alexion Pharmaceuticals in 
Cheshire, Connecticut, has 
agreed to pay US$8.4 billion 
to buy Synageva BioPharma 
in Lexington, Massachusetts. 
Synageva is developing 

a drug for a rare genetic 
disease called lysosomal 

acid lipase deficiency; the 
therapy is in late-stage clinical 
trials. The deal, announced 
on 7 May, highlights the 
pharmaceutical industry's 
growing interest in treatments 
for rare conditions, and will 
expand Alexion’s repertoire 
of therapies for metabolic 
diseases. 


Escape-pod test 


Private spaceflight company 
SpaceX passed a crucial test 

by successfully deploying 

the emergency escape 

pod of its Crew Dragon 

rocket system. The capsule 
launched (pictured) from 
Cape Canaveral, Florida, on 

6 May. As planned, it flew 

1.5 kilometres above the 
Atlantic Ocean before the 
escape pod detached, unfurling 
three parachutes to splash 
down safely into the sea. 
Sensors aboard the pod showed 
that the conditions would have 
been safe for humans. SpaceX, 


TREND WATCH 


The US Department of Energy 
should create a sustained and 
predictable programme of 
funding focused on new solar 
technologies, to support a 
“massive expansion’ of solar 
power by mid-century, says a 
report from the Massachusetts 
Institute of Technology in 


Cambridge (see go.nature.com/ 


ogv7wa). Released on 5 May, it 
urges policies to promote solar 
energy, including subsidies for 
generating technologies and 
incentivized programmes for 


developing low-cost technologies. 


which is headquartered in 
Hawthorne, California, is 
expected to start shuttling 
astronauts into space in 2017. 


POLICY 


Disease monikers 
The World Health 
Organization (WHO) 

has issued guidelines for 
naming new diseases. The 
rules, released on 8 May, 
recommend naming diseases 
according to what they 

do rather than after the 
individual, region or species 
in which they were found. 
The name swine flu, for 
instance, has hurt the pork 
industry even though the 
virus is not specific to pigs, 
and Middle Eastern countries 
have been concerned that the 
name of the viral infection 
Middle East Respiratory 
Syndrome will lead to 
stigmatization. Researchers 
should also avoid names with 
frightening words such as 
‘fatal, the WHO said. 


SOLAR SUPPORT 


Symbolic CO, peak 
The global monthly 
concentration of atmospheric 
carbon dioxide has exceeded 
400 parts per million (p.p.m.) 
for the first time since agencies 
began to track it in the 1960s. 
Analysing air samples from 
40 sites worldwide, the 

US National Oceanic and 
Atmospheric Administration 
calculated an average CO, 
level of 400.83 p.p.m. for 
March. Calibrated data for 
April are not yet available, 
but scientists expect CO, 
concentration to remain 
above 400 p.p.m. for April 
and throughout May, when it 
normally peaks. Daily levels 
first surpassed 400 p.p.m. in 
2012 at Arctic sites. 


Tailored tests 


Personalized screening and 
medicines could save the 
United States billions of dollars, 
but such interventions are 
unlikely to emerge because 

of the structure ofits health- 
care system, says a panel of 
doctors and economists in The 
Lancet (V.J. Dzau et al. Lancet 
http://doi.org/4km; 2015). 
Modelling by the group, led by 
Victor Dzau of the Institute of 
Medicine in Washington DC, 
found a 10% fall in diabetes 
and cancer cases as a result 

of using tailored tests would 
generate US$166 billion in 
improved health over 50 years. 


The US government has a long history of funding solar-energy 
technologies, but will need to raise investment in the future. 


— Photovoltaic technology = Concentrated solar-power technology 


(2014 US$ millions) 


Spending by US Department of Energy 
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SEVEN DAYS | THIS WEEK | 


16-20 MAY 
Scientists and health- 
care professionals 
meet in Dublin for the 
European Congress 
of Endocrinology 

to discuss the latest 
advances in the field. 
go.nature.com/p5mtoq 


18-22 MAY 

The latest research in 
the science of sound 

is presented at the 
Acoustical Society 

of America’s spring 
meeting in Pittsburgh, 
Pennsylvania. 
go.nature.com/wseczh 


18-26 MAY 

The World Health 
Organization holds its 
general assembly in 
Geneva, Switzerland. 
Delegates from all its 
member states discuss 
and decide on health 
policies, programmes 
and budgets. 
go.nature.com/kiqaym 


But the group notes that private 
insurers cover therapies by 
judging only short-term gains. 


High-energy ties 
The United States and CERN, 
Europe's high-energy physics 
lab near Geneva, Switzerland, 
pledged to align their long- 
term strategies for particle 
physics on 6 May. Although 
the United States has only 
observer status in CERN, 

it is a major contributor to 
experiments such as those at 
the Large Hadron Collider. 
CERN will now reciprocate 
with more-direct involvement 
in US-based experiments, 
notably in neutrino science. 
Also on 6 May, Turkey 
formalized its associate 
membership of CERN, which 
grants it access to CERN 
Council meetings. 


> NATURE.COM 
For daily news updates see: 
WWww.nature.com/news 


14 MAY 2015 | VOL 521 | NATURE | 131 


© 2015 Macmillan Publishers Limited. All rights reserved 


PRABIN POKHREL/NEPAL POLICE/HANDOUT/EPA 


NEWSIN FOCUS 


How UK Haze of Body’s microbial 
election surprises will antimatter found in DNA raises privacy 
shape science p.134 thunderclouds p.135 concerns p.136 


A look at the nation’s 
battle to improve science 
and innovation p.144 


‘¢) 


£ yt ; oe 


~~ 


The village of Langtang was buried by a landslide last month after a major earthquake struck Nepal. 
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Mappers rush to pinpoint 
landslide risk in Nepal 


Geologists say hazard posed by earthquake-loosened earth could linger for years. 


BY ALEXANDRA WITZE 


s Nepal digs out from the devastating 
Avni earthquake of 25 April, 

researchers are braced for the next 
geological hazard. In June, monsoon rains will 
begin to drench the hillsides destabilized by the 
quake, raising the risk of disastrous landslides. 

Geologists are racing to identify areas that 
could collapse and bury villages or block impor- 
tant roads. They say that the hazard is likely to 
linger for years. 

So far, the situation has been grim but better 
than expected. Scientists have identified thou- 
sands of landslides in satellite imagery, rather 
than the tens or hundreds of thousands they 


anticipated from the aftermaths of other large 
earthquakes in mountainous areas. “Something 
slightly mysterious has happened,’ says David 
Petley, a geoscientist at the University of East 
Anglia in Norwich, UK. “The landslide problem 
is far from trivial, but it’s not as desperately seri- 
ous as we might have feared.” 

The most violent event occurred in the 
Langtang valley, a popular trekking area in the 
mountains north of Kathmandu. Part of a gla- 
cier above Langtang village broke off and plum- 
meted into the valley below. Witnesses report a 
powerful wind blasting snow, dust and building 
fragments over the village, suggesting that the 
avalanche was so powerful that it sent a pressure 
wave racing outwards, says Dorothea Stumm, a 
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glaciologist at the International Centre for Inte- 
grated Mountain Development in Kathmandu. 
Satellite images reveal that a giant swathe of 
mountainside has been obliterated, right down 
to the river on the valley floor. 

Elsewhere, Nepalese authorities worry that 
landslides might block rivers. This can form 
lakes that obstruct major roads, force peo- 
ple to evacuate their homes or even burst in a 
single catastrophic event. Last August, a land- 
slide triggered by heavy rains dammed the Sun 
Kosi River in northern Nepal, killing roughly 
150 people and causing widespread flooding. 

At least four groups are using government 
and commercial satellite imagery to map such 
hazards. The recent quake caused landslides 
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> that blocked valleys, including several 
along the Trisuli River, which runs between 
Nepal and Tibet. “It looks to be quite risky 
there at the moment,’ says Nick Rosser, a 
landslide expert at Durham University, UK. 
“This will be the area of biggest impact when 
the monsoon starts, as rainfall totals there 
are among some of the highest in the coun- 
try. In preparation, a team from the Chinese 
Academy of Sciences has been surveying 
landslide sites along roads that lead to Tibet. 
Another area of concern is a landslide-cre- 
ated lake on the Marshyangdi River, which 
runs above the Annapurna trekking circuit. 

The monsoon in Nepal typically lasts from 
June to September, and fatal landslides hap- 
pen mostly during that period. The amount 
of monsoon rainfall varies dramatically from 
year to year (D.N. Petley et al. Nat. Hazards 
43, 23-44; 2007). The fact that the earth- 
quake struck in April may have been some- 
thing of a saving grace, because dry soils are 
harder to dislodge than is wet ground. Many 
more landslides would have happened had 
the quake struck just a few months later, 
says Binod Tiwari, a geotechnical engineer 
at California State University in Fullerton. 

The country was at high risk of landslides 
even before the tremor. Nepal rides atop 
the ongoing collision between India and 
Asia, a geological bust-up that pushes the 
Himalayas to ever-greater heights. The rug- 
ged terrain, unstable soils, heavy rains and 
mountain communities combine to make it 
one of the world’s landslide hotspots. 

The earthquake, now named the Gorkha 
quake, has worsened the situation. It rup- 
tured the main Himalayan geological fault 
to the northwest of Kathmandu. By 11 May, 
more than 8,000 people had been confirmed 
dead — although that is many thousands 
fewer than experts had projected. Buildings 
in Kathmandu may have been sturdier than 
thought, or the ground may not have shaken 
quite as strongly as would be expected from 
the quake's magnitude. 

Gentler shaking would also help to explain 
the relative paucity of landslides, says Marin 
Clark, a geomorphologist at the University 
of Michigan in Ann Arbor. “Either that, or 
the rocks are stronger than we estimated,” 
she says. 

But the risk of landslides remains high. 
After the magnitude-7.6 Chi-chi earth- 
quake in Taiwan in 1999 and the magni- 
tude-7.9 Sichuan earthquake in mainland 
China in 2008, the number of landslides 
soared for years as sediment continued to 
shift in fresh debris flows (B. Yu et al. Eng. 
Geol. 182, 130-135; 2014). Authorities in 
Nepal need to prepare for the monsoon 
season by inspecting and monitoring the 
places most at risk, Rosser says. “That’s 
where we need to be focusing,” he says. 
“Places where we have landslides and there's 
a population.” m 
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What the UK election 
means for science 


A Conservative majority, Scottish National Party rise and 
Liberal Democrat losses all have implications for research. 


BY ELIZABETH GIBNEY 


for the Conservatives, to the decimation of 

the Liberal Democrats and the rise of the 
Scottish National Party, the UK general election 
on7 May was full of surprises — many of which 
will have implications for science. 

Scientists should expect an emphasis on 
austerity allied to economic growth — a pillar 
of the Conservative-led government of the 
past five years, which ruled in coalition with 
the Liberal Democrats. During that time, the 
science budget was frozen, and dropped in real 
terms. But there is support for science in Par- 
liament, and an understanding of its relation 
to the economy, says Paul Nightingale, deputy 
director of the Science Policy Research Unit 
at the University of Sussex, UK. So cutting the 
research budget would bea “hard sell’, he says. 
Instead, he expects “more explicit attempts to 
align research with economic growth”. 

Before the election, the Conservatives 
pledged to seek a “strong” deal at the United 
Nations climate negotiations in December 
that “keeps the goal of limiting global warm- 
ing to two-degrees firmly in reach”. The party 
also promised to end support for onshore wind 
farms and to encourage expansion of nuclear 
power and gas, including fracking. 


i 


ihn 


| a an outright majority in Parliament 


Former Liberal Democrat MP Julian Huppert was a 
vocal supporter of science. 
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Nick Hillman, director of the Higher 
Education Policy Institute in Oxford, UK, says 
that the reappointment of Theresa May as 
Home Secretary may trouble scientists. The 
last government's tough stance on immigra- 
tion included cutting the post-study work visa 
for international students (see Nature 506, 
14-15; 2014). But he sees promise in the new 
science and universities minister, Jo Johnson, 
younger brother of the Mayor of London and 
new Member of Parliament (MP), Boris John- 
son. Although not a scientist, Jo Johnson is 
pro-European and close to the Chancellor of 
the Exchequer, George Osborne. 

A certain outcome of the Conservative win 
is a referendum by 2017 on whether to leave 
the European Union. Nightingale suspects that 
people will vote to stay in. If Britain did leave, 
it would probably not be cut out of European 
research programmes, says Kieron Flanagan, a 
science-policy researcher at Manchester Busi- 
ness School. However, it would feel the loss of 
cash from a different European pot, which Brit- 
ain uses to fund science-related infrastructure. 

Meanwhile, the Scottish National Party’s 
increased representation from 6 to 56 seats 
may affect science across the United Kingdom. 
Growing Scottish nationalism is likely to fur- 
ther a focus on regional development that was 
part of the Conservative Party’s manifesto, says 
Nightingale. This could bolster a trend to allo- 
cate science funding directly from the Treasury 
in London to regional projects, such as the UK 
National Graphene Institute in Manchester, 
he says, rather than through national funding 
agencies. However, David Price, vice-provost 
for research at University College London, 
warns that without an increase in the science 
budget, the regional agenda would be pointless. 

Another defining moment was the crash in 
support for the Liberal Democrats, who lost 49 
of their previous 57 seats, including some high- 
profile MPs. Keenly felt by many scientists was 
the loss of Julian Huppert, the former Liberal 
Democrat MP for Cambridge. Previously a bio- 
chemist at the University of Cambridge, he was 
vocal on science issues and popular with scien- 
tists. On 8 May, Huppert teasingly expressed 
fears about the future: “Scientists love control 
experiments — but I certainly didn't want a Tory 
government to show how effective the Liberal 
Democrats actually were in government.” m 
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Lightning is only the most visible product of clouds’ intense electric fields. 


Rogue antimatter 
found in clouds 


Aeroplane detects signature spike in thundercloud photons 
that does not fit any known source of antiparticles. 


BY DAVIDE CASTELVECCHI 


hen Joseph Dwyer’s aeroplane 
took a wrong turn into a thunder- 
cloud, the mistake paid off: the 


atmospheric physicist flew not only through a 
frightening storm but also into an unexpected 
—and mysterious — haze of antimatter. 

Although powerful storms have been known 
to produce positrons — the antimatter ver- 
sions of electrons — the antimatter observed 
by Dwyer and his team cannot be explained by 
any known processes, they say. “This was so 
strange that we sat on this observation for sev- 
eral years,’ says Dwyer, who is at the University 
of New Hampshire in Durham. 

The flight took place six years ago, but 
the team is only now reporting the result 
(J. R. Dwyer et al. J. Plasma Phys.; in the press). 
“The observation is a puzzle,” says Michael 
Briggs, a physicist at the NASA Marshall Space 
Flight Center in Huntsville, Alabama, who was 
not involved in the report. 

A key feature of antimatter is that when 
a particle of it makes contact with its 


ordinary-matter counterpart, both are instantly 
transformed into other particles in a process 
known as annihilation. This makes antimatter 
exceedingly rare. However, it has long been 
known that positrons are produced by the 
decay of radioactive atoms and by astrophysical 
phenomena, such as cosmic rays plunging into 
the atmosphere from 

outer space. In the 

past decade, research 

by Dwyer and others 

has shown that 

storms also produce 

positrons, as well 

as highly energetic 

photons, or y-rays. 

It was to study such atmospheric y-rays that 
Dwyer, then at the Florida Institute of Tech- 
nology in Melbourne, fitted a particle detector 
ona Gulfstream V, a type of jet plane typically 
used by business executives. On 21 August 
2009, the pilots turned towards what looked, 
from its radar profile, to be the Georgia coast. 
“Instead, it was a line of thunderstorms — 
and we were flying right through it,” Dwyer 


© 2015 Macmillan Publishers Limited. All rights reserved 


IN FOCUS 


says. The plane rolled violently back and forth 
and plunged suddenly downwards. “I really 
thought I was going to die” 

During those frightening minutes, the 
detector picked up three spikes in y-rays at an 
energy of 511 kiloelectronvolts, the signature 
ofa positron annihilating with an electron. 

Each y-ray spike lasted about one-fifth of a 
second, Dwyer and his collaborators say, and 
was accompanied by some y-rays of slightly 
lower energy. The team concluded that those 
y-rays had lost energy as a result of travelling 
some distance and calculated that a short-lived 
cloud of positrons, 1-2 kilometres across, had 
surrounded the aircraft. But working out what 
could have produced such a cloud has proved 
hard. “We tried for five years to model the pro- 
duction of the positrons,’ says Dwyer. 

Electrons discharging from charged clouds 
accelerate to close to the speed of light, and can 
produce highly energetic y-rays, which in turn 
can generate an electron—positron pair when 
they hit an atomic nucleus. But the team did 
not detect enough y-rays with sufficient energy 
to do this. 

Another possible explanation is that the 
positrons originated from cosmic rays, parti- 
cles from outer space that collide with atoms in 
the upper atmosphere to produce short-lived 
showers of highly energetic particles, includ- 
ing y-rays. “There’s always like a light drizzle 
of positrons,’ says Dwyer. In principle, there 
could be some mechanism that steered the 
positrons towards the plane, he says. But the 
motion of positrons would have created other 
types of radiation, which the team did not see. 

The team’s data are a “cast-iron signature” 
of positrons, says Jasper Kirkby, a particle 
physicist who heads an experiment investigat- 
ing a possible link between cosmic rays and 
cloud formation at the CERN particle-physics 
laboratory near Geneva, Switzerland. But “the 
interpretation needs to be nailed down” In par- 
ticular, he says, the team’s estimate of the size of 
the positron cloud is not convincing. 

If Kirkby is right, and the cloud was smaller 
than Dwyer’s team estimates, that could imply 
that the positrons were annihilating only in the 
immediate vicinity of the aircraft, or even on 
the craft itself. The wings could have become 
charged, producing extremely intense electric 
fields around them and initiating positron 
production, says Aleksandr Gurevich, an 
atmospheric physicist at the Lebedev Physical 
Institute in Moscow. 

To answer these and other questions, Dwyer 
needs fresh observations of the innards of 
thunderclouds. To that end, he and others are 
sending balloons straight into the most violent 
storms, and the US National Science Founda- 
tion even plans to fly a particle detector on 
an A-10 ‘Warthog’ — an armoured anti-tank 
plane that could withstand the extreme envi- 
ronment. “The insides of thunderstorms are 
like bizarre landscapes that we have barely 
begun to explore,” says Dwyer. m 
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Microbiome privacy risk 


The DNA of microorganisms living on a person’s body could identify that individual. 


BY EWEN CALLAWAY 


( } all it a ‘gut print. The collective DNA 
of the microbes that colonize a human 
body can uniquely identify someone, 

researchers have found, raising privacy issues. 

The finding’, published in Proceedings of 
the National Academy of Sciences on 11 May, 
suggests that it might be possible to identify 
a participant in an anonymous study of the 
body’s microbial denizens — its microbiome 
— and to reveal details about that person's 
health, diet or ethnicity. A publicly available 
trove of microbiome DNA maintained by the 
US National Institutes of Health (NIH), mean- 
while, already contains potentially identifiable 
human DNA, according to a study’ published 
in Genome Research on 29 April. 

The papers do not name individuals on the 
basis of their microbiomes — and predict that 
it would be difficult to do so currently — but 
they do suggest that those conducting micro- 
biome research should take note. 

“Right now, it’ a little bit of a Wild West as 
far as microbiome data management goes,” 
says Curtis Huttenhower, a computational 
biologist at the Harvard T. H. Chan School of 
Public Health in Boston, Massachusetts, who 
led the latest study’. “As the field develops, we 
need to make sure there’ a realization that our 
microbiomes are highly unique.” 

Human-genomics researchers have grappled 
with privacy concerns for years. In 2013, scien- 
tists showed’ that they could name five people 
who had taken part anonymously in the inter- 
national 1,000 Genomes project, by cross-refer- 
encing their DNA with a genealogy database that 
also contained ages, locations and surnames. 

In recent years, the microbiome’s influence 
on our health and behaviour has become a hot 
research topic. Data from human-microbiome 
studies tend to end up in public repositories, 
but it was not clear whether microbiomes were 
permanent enough in individuals to identify 
them over time. 
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DNA from bacteria in human faeces could be used 
as a ‘gut print’ to identify individuals. 


Working with publicly available data from 
the NIH Human Microbiome Project (HMP), 
Huttenhower’s team searched samples taken 
from body sites, including the gut, mouth, 
skin and vagina, for combinations of microbial 
genetic markers that were both unique to a per- 
son and stable over time. (Although the HMP 
does not identify individuals by name, it is pos- 
sible to compare a participant's first sample with 
a second one donated weeks or months later.) 

Stool samples offered the best microbiome 
signatures; a person's first sample could be 
linked to their second sample 86% of the time. 
By contrast, skin samples could be accurately 
matched only about one in four times. The 
researchers note that DNA signatures based on 
individual strains of microbes did the best job 
of distinguishing people — much better than 
those based only on microbial species. 

Still, Huttenhower concludes that it would be 
“exceptionally challenging to do anything with 
the microbiome data in a single study”. The like- 
liest risk to privacy, he thinks, would come from 
a scenario in which someone had participated 
in two different microbiome studies that each 
contained different pieces of accompanying 


information, such as age and health status. 

But microbiomes could also pose a privacy 
risk because they inevitably get jumbled up 
with human DNA. Although the NIH went 
to considerable lengths to weed human DNA 
out of its HMP database, a team led by compu- 
tational biologist Jonathan Allen of Lawrence 
Livermore National Laboratory in California 
has found’ that contamination is still rife. For 
example, the team found sequences known as 
short tandem repeats that tend to vary between 
individuals and are used for making DNA 
matches in forensics. It is not clear whether 
their presence in microbiome samples could 
constitute a precise DNA signature, Allen 
says, but the rise of publicly available DNA 
databases increases the likelihood. Genome 
Research agreed to publish the paper by Allen 
and his team only if the NIH removed the 
known human sequences from its database. 

The odds of identifying someone on the 
basis of their microbiome is low, but research- 
ers should take reasonable steps to protect 
privacy, says Yaniv Erlich, a computational 
geneticist at the New York Genome Center 
who led the team that identified’ participants 
in the 1,000 Genomes study. Those who took 
part in the HMP were advised of the risk, says 
Amy McGuire, a bioethicist at Baylor College of 
Medicine in Houston, Texas. “I don’t think there 
should be premature panic over this.” 

An overreaction could slow understanding 
of the microbiome. Laura Rodriguez, director 
of policy at the NIH’s National Human Genome 
Research Institute in Bethesda, Maryland, says 
that as long as protections are in place, such as 
removing as much human DNA from the HMP 
as possible, “we would want to keep it in open 
access because of the value it adds to science”. = 
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An Ebola survivor is helped out of a clinic in Sierra Leone built by Médecins Sans Frontiéres. 


PUBLIC HEALTH 


Ebola failures prompt 
WHO rethink 


Health agency’s annual meeting will address shortcomings 
in outbreak response highlighted by West Africa crisis. 


BY ERIKA CHECK HAYDEN 


s West Africa's Ebola epidemic wanes, a 
A= is building for reform of the World 
Health Organization (WHO). When 
the 194 member states of the United Nations 
body meet on 18-26 May in Geneva, Switzer- 
land, for the 68th World Health Assembly, they 
will consider a number of proposals intended 
to make the WHO more nimble and effective at 
responding to fast-moving public-health crises. 
At stake is the organization’s future as the 
primary responder to global health emer- 
gencies. In September 2014, after the WHO's 
slow response in West Africa, the UN created 
a separate organization, the UN Mission for 
Ebola Emergency Response, to tackle the out- 
break. And last month, UN secretary-general 
Ban Ki-moon appointed a panel on managing 
future health crises that is expected to issue a 
report by the end of this year. Observers note 
that previous WHO failures led to the forma- 
tion of independent initiatives such as GAVI, 
the Vaccine Alliance, which vaccinates children 
in the developing world, and the Global Fund to 
Fight AIDS, Tuberculosis and Malaria. 
“The worry would be that some other group 


will take contingency planning and emergency 
response away from WHO) says infectious- 
disease specialist Barry Bloom of the Harvard 
TT. H. Chan School of Public Health in Boston, 
Massachusetts. “That has been the trend and 
leads back to the fundamental question of what 
is the role of WHO”? 

WHO director-general Margaret Chan has 
acknowledged that the agency made big mis- 
takes in the Ebola epidemic. The outbreak 
revealed “inadequacies and shortcomings in 
this organization's administrative, managerial 
and technical infrastructures,’ she said in a 
January speech, which asked for reforms to the 
WHO and for an external review of the agency's 
performance. A draft of the review released on 
8 May concluded that “WHO does not have the 
operational capacity or culture to deliver a full 
emergency public health response.” 

The main reforms up for discussion in 
Geneva include creating a US$100-million 
fund for response to fast-moving events such 
as the Ebola epidemic; setting up an interna- 
tional cadre of first responders to outbreaks; 
and setting guidelines for how aid groups, 
foundations, academic institutions and corpo- 
rations can take part in WHO meetings. Such 
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guidelines have been in the works for years as 
global public-health spending has increasingly 
shifted to non-governmental organizations. But 
the importance of private entities was brought 
to the fore by the Ebola outbreak, with groups 
such as Médecins Sans Frontiéres (also known 
as Doctors Without Borders) proving to be 
more effective than the WHO or governments. 

The agency is also asking member states to 
boost its budget by 8% for 2016-17, after hav- 
ing received flat funding since 2012. And Chan 
wants to strengthen the International Health 
Regulations — rules agreed by member states 
in 2005 that require countries to set up basic 
outbreak-response mechanisms — but there are 
no specific proposals for how this would occur. 

Many reformers attending the meeting are 
worried that even the enormity of the Ebola 
epidemic will not motivate the WHO mem- 
bership to adopt reform, and are pessimistic 
about the effectiveness of such changes if they 
are approved. Some of the same ideas were pro- 
posed — but never adopted — after outbreaks 
of the respiratory syndrome SARS, H5N1 ‘bird 
flu’ and H1N1 ‘swine flu in the 2000s. 

“I want to emphasize the importance of 
making these changes now, while the epidemic 
is fresh in our mind, and not wait, because the 
political momentum is with us now, and it will 
fade the same way it did with SARS and H1N1, 
says Lawrence Gostin, a health-law and policy 
specialist at Georgetown University in Wash- 
ington DC who serves on a WHO committee 
charged with reforming the organization. 

That sentiment is shared by Adam Kamradt- 
Scott, a health-security specialist at the 
University of Sydney in Australia. But he is 
pessimistic about the probability of change and 
frustrated about poor resources. “I doubt that 
we will see any significant reforms emerge,’ he 
says. “While governments complain on the one 
hand that the organization isnt effective, they 
also don't give the WHO sufficient resources or 
authority to do the job they’ve asked it to do” 

The WHO relies on voluntary contributions 
for more than 70% of its budget; it is unlikely 
that countries would provide for the new task 
force or contingency fund at needed levels, says 
Kamradt-Scott. 

Yet in the wake of the Ebola outbreak, gov- 
ernments and international organizations have 
signalled a willingness to provide funding to 
strengthen preparedness for health crises, 
says Josh Michaud, associate director of global 
health policy at the non-profit Kaiser Fam- 
ily Foundation in Washington DC. He notes 
that the United States last year approved more 
than $5 billion in emergency funding for Ebola 
response and that the World Bank has pledged 
to provide $650 million to help countries to 
rebuild after the epidemic. The United King- 
dom has also said that it would provide $10 mil- 
lion to the WHO’s proposed contingency fund. 

“It’s not easy or cheap to build up these 
capacities,’ Michaud says. “But there are signs 
that Ebola has sparked real change.” m 
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Genetically modified mustard is being grown in fields in New Delhi. 


BIOTECHNOLOGY 


India eases stance 
on GM crop trials 


States begin to permit field tests of transgenic plants. 


BY SANJAY KUMAR 


researchers testing genetically modified 

(GM) crops. Its government barred the 
commercial planting of a transgenic aubergine 
(a vegetable locally known as brinjal) after pro- 
tests from anti-GM activists. Then it gave state 
governments the power to veto transgenic- 
crop field trials. The result: an effective mor- 
atorium on such trials. “We felt as if we had 
come up against a brick wall, and might as 
well chuck it in and do something else,” says 
molecular biologist Bharat Char, who works 
for Maharashtra Hybrid Seeds Company 
(Mahyco), a firm in Jalna that pioneered the 
GM brinjal (and in which agricultural giant 
Monsanto holds a minority stake). 

But under the government of Prime Minister 
Narendra Modi, voted into power a year ago, 
India has quietly changed course on GM field 
testing. In the past year, eight Indian states 
largely aligned with Modi’s Bharatiya Janata 
Party have approved field trials of GM crops, 


iz ive years ago, India was a hostile place for 
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between them allowing tests that include 
transgenic rice, cotton, maize (corn), mustard, 
brinjal and chickpea, according to documents 
seen by Nature (see ‘GM crop trials’). “There is 
no better feeling than to know that your tech- 
nology is performing in the field,’ says Char, 
who himself planted salt-tolerant GM rice in 
Maharashtra state in January. 


DEVELOPMENT TENSIONS 

The relaxed attitude to GM crop trials is not 
only reviving the enthusiasm of Indian biotech 
researchers — it will also be keenly watched 
around the world, says Dominic Glover, an 
agricultural socio-economist at the University 
of Sussex in Brighton, UK. “India’s attitude 
towards transgenic crops has a symbolic impor- 
tance beyond its borders,” he says, because it 
epitomizes tensions that surround the use of 
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A Nature special issue 
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GM technology in developing nations. 

On the one hand, India must improve its 
agricultural productivity to feed its rapidly 
growing population. The country should 
thus embrace GM efforts to develop higher- 
yield crops that are resistant to pests or grow 
well in droughts or harsh environments such 
as salty soil, says biochemist Govindarajan 
Padmanaban, former director of the Indian 
Institute of Science in Bangalore. 

On the other hand, India has more than 
100 million farmers, who are concerned that if 
GM crops become prevalent, their livelihoods 
and the nation’s food supply will increasingly 
rely on expensive, rapidly changing and pro- 
prietary seed technologies owned by large cor- 
porations, says Glenn Stone, an environmental 
anthropologist at Washington University in St. 
Louis, Missouri. 

These tensions erupted in 2010, when 
farmers and anti-GM groups organized huge 
public protests that led to the brinjal ban (see 
Nature http://doi.org/bkt7dh; 2010). And they 
regularly flare up in criticisms of India’s 2002 
adoption of GM cotton, which contains genes 
to ward off certain insects. This is the coun- 
try’s only permitted commercial GM crop, 
but it is grown in such quantities that India is 
the world’s fourth-biggest GM-crop producer, 
behind the United States, Brazil and Argentina. 


SLOW PROGRESS 

The new lenience on GM field trials has not 
reached all of India: more than 20 states and 
territories are still exercising their vetoes. In 
meetings between March and July last year, 
the Genetic Engineering Appraisal Committee 
(GEAC), part of India’s environment ministry, 
granted permission to 80 field-trial applications, 
but state-government blocks meant that many of 
the trials were never begun. 

These local bottlenecks have mainly hurt 
biotech researchers in India’s universities and 
public-sector institutions, says Padmanaban: 
they retard the progress of domestic tech- 
nology, whereas multinational firms such 
as Monsanto can test GM crops elsewhere. 
Akshay Pradhan, a geneticist working on 
transgenic mustard plants with the Univer- 
sity of Delhi's Centre for Genetic Manipula- 
tion of Crop Plants, says that his team had 
its technology ready as far back as 2002 but 
is only now resuming field trials after a two- 
year hiatus. “This delay has surely thwarted 
its commercialization,’ he says. Agricultural 
scientists want faster progress: in February, 
some petitioned Modi to lift barriers includ- 
ing the requirement to seek state-government 
approval for field trials after getting the 
thumbs-up from the environment ministry. 

But activists and non-governmental organ- 
izations (NGOs) argue that India should be 
wary of welcoming transgenic crops. They fre- 
quently raise concerns that such crops may be 
unsafe for the environment or human health, 
and that Indian regulators have conflicts of 
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interest and have not put in place sufficient 
mechanisms to carefully monitor field trials. 
Similar criticisms were raised in 2012 by a 
technical committee convened by India’s 
Supreme Court. The court is still considering 
a moratorium on planting GM crops (includ- 
ing in field trials), which anti-GM activists 
requested in a petition a decade ago. 

One issue that critics and scientists agree on 
is the need for legislation to improve biotech- 
nology regulations. A regulatory bill that failed 
to get through parliament in 2013 is now being 
revised, but could take two years or more to 
be passed, says Sunkeswari Raghavendra Rao, 
an adviser to the government's Department of 
Biotechnology. 


COMMERCIAL CAUTION 

India’s government seems to be treading much 
more cautiously on commercial cultivation 
of transgenic crops than on field trials — 
although farmers in neighbouring Bangladesh 
began cultivating GM brinjal last year. (Some 
policy-makers who do not want to be named 
are concerned that the brinjal, or its seeds, 
will make its way into India anyway, over the 
porous Indo-Bangladesh border.) 

But the government seems reluctant to 
engage in transparent debates about the 
pros and cons of pushing forward the use of 
GM biotechnology in India, as well as about 


Andhra Pradesh 


Karnataka , Chickpea, cotton 


Cotton 


GM CROP TRIALS 


Transgenic field crop trials have taken place in eight 
Indian states and territories over the past year. 


the details of field trials being allowed in the 
country. Details of GEAC meetings that used 
to be publicly posted on its website now no 
longer appear online, and GEAC officials 
would not talk to Nature for this article. “I find 
this secrecy shocking and absurd,” says Pushpa 
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Bhargava, a retired molecular biologist whom 
the Supreme Court has nominated to attend 
GEAC meetings. And the government has 
come under fire for freezing the bank accounts 
of the Indian branch of environmental NGO 
Greenpeace. It has cited financial irregularities, 
which Greenpeace denies, but a widely leaked 
intelligence report prepared for Modi last year 
stated that the group’s anti-GM campaigning 
was thwarting India’s development. = 


CORRECTIONS 

The graphic in the News story ‘Pluto mission 
hunts for hazards’ (Nature 521, 14-15; 
2015) put Nix on the wrong orbital path. The 
correct image can be seen at go.nature.com/ 
sbpxsu. The News story ‘Pint-sized DNA 
sequencer impresses first users’ (Nature 
521, 15-16; 2015) mistakenly stated that 
the MinlON device has problems reading 
long, repetitive regions of DNA sequence. 

It should have said that those difficulties 
occur in sections of genome that are rich in 
long stretches of a single DNA base. And the 
reference list in the News story ‘Mysterious 
galactic signal points LHC to dark matter 
(Nature 521, 17-18; 2015) omitted two 
entries. The complete list can be seen at 
go.nature.com/mzopta. 
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A specialissue explores 
the enormous potential 
and major challenges for 
research in south Asia’s 
superpower. 


1.3 billion people and a steady growth rate, 

it is expected to become the world’s most 
populous nation within a generation. Its gross 
domestic product more than tripled between 
2000 and 2013, and its economy ranks third 
in the world in terms of purchasing power, 
behind only China and the United States. 
India’s scientific production has also surged, 
with the number of published papers quadru- 
pling over the same period. 

But the country has far to go before it earns 
the status of a scientific superpower. By almost 
every metric — spending, number of research- 
ers and quality of publications — India under- 
performs relative to developed nations and the 
ascendant economies to which it is most often 
compared, such as China and Brazil. 

This week, Nature takes an unvarnished 
look at the challenges and opportunities for 
scientists in India. An infographic (page 142) 
assesses the country’s strengths and weaknesses 
by comparing its research and development 
landscape with those of comparable countries. 
A News Feature (page 144) probes beneath the 
numbers, examining Indian successes in space, 
biotechnology and energy, as well as exploring 
bureaucracy, underfunding and other obstacles 
to higher education and scientific research. 

Scientists have high hopes that Krishnas- 
wamy VijayRaghavan, the new secretary of the 
Department of Biotechnology, can help to drive 
change in biomedical research. He is profiled on 
page 148. Ten Indian research leaders offer their 
suggestions for how to build their country’s sci- 
entific capacity — from better funding, facilities, 
education and mentoring to fairer recruiting, 
more autonomy and a focus on local problems 
(page 151). Cheap and clean power will be key, 
say energy specialists Arunabha Ghosh and 
Karthik Ganesan (page 156). Only by tackling 
such basic issues can India hope to catch up with 
other rapidly advancing nations. m 
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INDIA 


by the numbers 


BY RICHARD VAN NOORDEN 
& 


Highs and lows in 
the country’s 
research landscape. 


Elite research centres 


India’s 700 or so universities vary tremendously in quality. To identify the leading 
science institutions, Nature looked at the citation rates in Elsevier’s Scopus database 
for institutes that had produced more than 2,000 papers between 2010 and 2014. 


¢ Indian science is a study in contrasts. With its vast population and rapidly 
expanding economy, the country has ramped up scientific production at an 
impressive rate. India started the twenty-first century well behind Russia, 
France, Italy and Canada in terms of yearly publications and it now leads 
them all by healthy margins. It is quickly closing in on Japan. 

Despite those gains, India is not yet a major player in world science. Its 
publications generate fewer citations on average than do those of other 
science-focused nations, including other emerging countries such as Brazil 
and China. Relative to its size, India has very few scientists; many 
Indian-born researchers leave for positions abroad and very few foreign 
scientists settle in India. The country invests a scant portion of its economy 
in research and development (R&D), and it produces relatively few patents 
per capita compared with other nations. 

But there are bright spots. India boasts several world-class centres for 
science education, particularly the highly regarded Indian Institutes of 
Technology. Businesses in the country are investing more in R&D, which 
bodes well for future innovation. And more women are participating in 
science, although their numbers still fall far below those of men. + 


Top 10 
institutions 


Panjab University, 
Chandigarh 


Panjab University 

The country’s 
top-rated university in 
last year’s Times 
Higher Education World 
University rankings; its 
research is cited at 1.4 
times the world’s 
average. 
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Tata Institute of 
Fundamental Research 
Specializes in physics, 
mathematics and 
astronomy; around 
55% of its publications 
are internationally 
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than any other Indian 
research institute or 
company; it boasts 38 
national laboratories 
and 4,600 active 
scientists around the 
country. 


acceptance rates are 
about 2%, compared 
with around 6% at 
Harvard University. 
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Publications 


Since 2000, India has almost 
quadrupled its scholarly 
output, but that rate is 
surpassed by Brazil’s and 
China’s. India underperforms 
relative to its gross domestic 
product (GDP) and 
population. And its scholarly 
impact remains low: in 2013, 
it was nearly 30% below the 
world’s average. 
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Workforce 


With only 200,000 full-time 
researchers (14% of them 
female) in a population of 
nearly 1.3 billion, India ranks 
below Chile, Kenya, and 
many other countries in 
terms of the density of its 
scientific workforce. 
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2013 


More women are winning funding from 


competitive government grant 
schemes, according to India’s 


Department of Science and Technology. 
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INDIA 
CIENCE 
TEST 


The south Asian superpower has made great strides in research 
and development, but it has a long way to go. 


ith her jeans, T-shirt and spirited 
attitude, Tapasya Srivastava could 
pass for a student as she works in 
her brightly lit cancer-biology lab 
on the University of Delhi South Campus. Sriv- 
astava, who oversees a team of eight researchers, 
is thrilled that she earned “a small research space 
of my own” in 2010, while still in her thirties. 
“With a decent list of publications under my 
belt, Iam one of the few who have studied and 
undergone training entirely in India,’ she says. 

Eight kilometres away, in the chemical- 
engineering department of the Indian Institute 
of Technology Delhi, Shalini Gupta’s team is 
developing sensors to detect early-stage sepsis 
and typhoid. Gupta did her doctorate in the 
United States but returned to India to focus on 
its needs: “Iam more connected to society and 
its challenges,” she says. 

Srivastava and Gupta are part of a wave of 
young Indian scientists convinced that they can 
do high-quality research at home rather than 
having to move abroad. Such optimism reaches 
all the way to the top: in January, Indian Prime 
Minister Narendra Modi told an assembly of 
scientists to “dream, imagine and explore. You 
will have no better supporter than me.’ 

India has much to be proud of. Last year, 
it became the first to reach Mars on its initial 
attempt. It boasts a thriving pharmaceutical 
industry that produces low-cost medications 
that are desperately needed by the developing 
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world. And in his first year in office, Modi 
launched an ambitious plan to make India a 
leader in solar power. 

Such successes cannot hide the huge 
challenges facing this country of 1.3 billion 
people, which leads the world in tuberculosis 
incidence and maternal deaths, and lacks elec- 
tricity for one-quarter of its citizens. India is 
expected to become the world’s most populous 
nation within a generation, and it will require a 
robust science and technology sector to supply 
the needed energy, food, health care, jobs and 
growth. Yet researchers in India and abroad say 
that the country has a relatively weak founda- 
tion in science and engineering. 

Indian research is hampered by stifling 
bureaucracy, poor-quality education at most 
universities and insufficient funding. Succes- 
sive governments have pledged to increase 
support for research and development to 2% 
of India’s gross domestic product (GDP), but 
it has remained static at less than 0.9% of GDP 
since 2005. Despite its huge size, India has a 
relatively tiny number of researchers, and 
many of its budding scientists leave for other 
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countries, never to return. Only by tackling 
its systemic problems can India compete with 
other emerging powerhouses such as Brazil 
and China. 

“The density of scientists and engineers in 
India is one of the lowest in the world,” says 
Sunil Mani, an economist at the Centre for 
Development Studies in Trivandrum, who is 
assessing Indian science and engineering for 
an upcoming report by the United Nations 
Educational, Scientific and Cultural Organi- 
zation. “There are very many important areas 
where we are not able to do research” 


SPACE T0 GROW 
In one of the cleanest rooms in India, 
Mylswamy Annadurai is busy conducting 
fitness tests on a 750-kilogram patient — a 
gleaming satellite called ASTROSAT. The 
probe is strapped to a table, where it is being 
shaken at six times the strength of gravity to 
simulate the intense forces of lift-off. ASTRO- 
SAT must also pass tests in extreme high and 
low temperatures and vacuum conditions, 
followed by checks on its solar arrays and 
antennas. If all goes well, the satellite will blast 
into orbit by September, armed with two tele- 
scopes and four other instruments to study 
both nearby and distant stars. 

Annadurai, who is head of the satellite cen- 
tre of the Indian Space Research Organisation 
(ISRO) in Bangalore, says that ASTROSAT will 
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India is one of the leading nations in wind power and it has ambitious goals for increasing solar power over the next decade. 


be India’s “first full-fledged science mission” 
in space. It will carry instruments ten times 
heavier than those on India’s first mission to 
the Moon, 2008’s Chandrayaan-1, and its 2014 
Mars Orbiter Mission, nicknamed Mangalyaan. 

With its run of recent accomplishments, 
India has earned international acclaim for its 
ambitious space programme, which includes 
launch vehicles, communication satellites 
and one of the world’s largest constellation 
of remote sensing satellites, as well as its sci- 
ence missions. Since ISRO was founded in 
1969, the government has invested heavily in 
it, and even established a dedicated university 
in 2007 to train personnel. “The ISRO techni- 
cal test, assembly and launch facilities are first 
class,” says Paul Spudis, senior staff scientist at 
the Lunar and Planetary Institute in Houston, 
Texas, who was the principal investigator for 
one of Chandrayaan- 1’s experiments. 

Chandrayaan-1 carried an orbiter and 
a 35-kilogram probe that took images as it 
smashed into the Moon at high speed. ISRO 
plans to follow it in 2017 with Chandrayaan-2, 
which will gently set down a lander anda six- 
wheeled rover; together with an orbiter, they 
will study the composition of the Moon's sur- 
face. Up next after that is the Aditya mission to 
study the Sun’s corona, in 2018. 

Spudis is critical of last year’s Mars mission, 
calling it “largely irrelevant” and saying that it 
would have been better to return quickly to the 


moon. ISRO, he says, “seems to lacka strategic 
vision of what it wants to accomplish in space”. 
But the agency counters that it is pursuing sev- 
eral missions in parallel; the Mars mission just 
proceeded faster than Chandrayaan-2. 

And the success in reaching Mars has 
convinced others at ISRO that they can carry 
out world-class space-science missions, says 
Annadurai. “The Mars mission experience has 
once again strengthened our belief that we can” 


BIOTECH BONANZA 
In Genome Valley, a biotechnology park in 
Hyderabad, entrepreneur Krishna Ella is con- 
founding expectations. Ella returned home 
from the United States in 1996 with a 12-metre 
shipping container filled with vaccine-making 
equipment to support his grand plan of pro- 
ducing a US$1 vaccine for hepatitis B. That 
goal, which would make his vaccine at least 
an order of magnitude cheaper than the avail- 
able one, struck investors as crazy, he says. But 
within three years, Ella's company Bharat Bio- 
tech International Limited (BBIL) succeeded 
in producing the Revac-B+ hepatitis vaccine at 
$3 a dose, which has since dropped to 30 cents 
per dose. It went on to develop vaccines against 
Japanese encephalitis, rabies, haemophilus 
influenza virus B and, most recently, rotavirus. 
Each costs barely a dollar per dose. 
Affordable medicines are the corner- 
stone of India’s health-care sector, where 
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publicly funded hospitals struggle to provide 
treatment. The country has long battled infec- 
tious diseases such as tuberculosis, malaria 
and dengue, but is now facing rising numbers 
of non-communicable illnesses, including 
diabetes and coronary heart disease. A 2014 
report from the World Economic Forum and 
Harvard School of Public Health estimates 
that non-communicable diseases and mental 
illness could cost India $4.58 trillion by 2030. 

Low-price vaccines and generic drugs have 
helped India to carve out a niche in the inter- 
national pharmaceutical industry. The global 
medical charity Médecins Sans Frontiéres (also 
known as Doctors Without Borders), which 
relies on Indian generics for 80% of its anti- 
HIV drugs, hails the country as the “pharmacy 
of the developing world”. Other international 
organizations, including the UN children’s 
charity UNICEF and the Global Fund to Fight 
AIDs, Tuberculosis and Malaria, routinely 
use Indian vaccines and generic drugs to treat 
infectious diseases (see Nature 468, 143; 2010). 

But India is battling criticism over the 
quality of some of its pharmaceuticals. In 2012, 
for example, the World Health Organization 
took BBILs hepatitis B vaccine and oral polio 
vaccine off the list of drugs preapproved for use 
by the UN. Ella says that the issues related to 
documentation submission and that they have 
since been sorted out. The vaccines are now 
back on the list. 
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In 2014, the US Food and Drug Adminis- 
tration (FDA) sent warning letters to seven 
Indian firms over various concerns relating 
to pharmaceutical production there. An FDA 
spokesperson told Nature: “While some Indian 
companies meet US product quality standards, 
others have been found to lack sufficient con- 
trols and systems to assure drug quality, both of 
finished product and active ingredients.” The 
FDA has an India office to work closely with 
Indian drug regulators to solve those problems. 

And some in the biotech sector warn that 
India has a long way to go to create a thriv- 
ing enterprise in developing new drugs. The 
country’s success in the generics industry relies 
ona different set of skills: reverse-engineering 
pharmaceuticals created elsewhere by break- 
ing them into their components and remaking 
them through cheaper routes. 

“The challenge for the sector will be to 
graduate from reverse engineering to new- 
drug discovery,” says Pallu Reddanna, a bio- 
technologist at the University of Hyderabad. 
“There is need for incentives and promotion 
of academy-industry interactions.” 

The government and private sector are 
trying to jump-start such efforts by setting 
up incubators that help transfer university 
and lab know-how to industry, and provide 
infrastructure and financial support to start- 
ups. Such incubators are the “greatest changer 
in the drug-discovery sector in India’, says 
P. Yogeeswari, a chemist at the Hyderabad 
campus of Birla Institute of Technology and 
Science, 

Krishnaswamy VijayRaghavan, secretary of 
the government’s Department of Biotechnol- 
ogy, commends “incredible growth” in India's 
biotech entrepreneurship — despite the lack 
of big drug companies and the relatively low 
domestic investment in drug discovery. Inter- 
national and industry collaborations with aca- 
demia are helping to advance the sector, he says 
(see page 148). 

In 2013, the department started two major 
projects seeking drugs for drug-resistant tuber- 
culosis and chronic disorders such as heart dis- 
ease. In early leads, scientists have zeroed in 
on some human proteins that are crucial for 
the survival of multidrug-resistant tubercu- 
losis strains. Proof-of-concept studies in mice 
have demonstrated that targeting such host 
proteins could help to kill the drug-resistant 
strains, says VijayRaghavan (S. Jayaswal et al. 
PLoS Pathog. 6, €1000839; 2010). “We are at an 
exciting early applied stage,” he says. 


POWER HUNGRY 
Nearly 2,000 kilometres north of Genome 
Valley, 9.7 hectares of solar panels cover a 
building in Punjab state, generating 7.5 mega- 
watts of electricity. This project is India’s largest 
roof-top solar installation that is connected to 
an electrical power grid, and it signals India’s 
outsize ambitions in renewable energy. 

Coal supplies two-thirds of the electricity 


146 | NATURE | VOL 521/14 MAY 2015 


in India and will remain king for some time. 
But the government has set aggressive goals 
for installing solar-energy capacity. In 2014, 
Modi’s government announced that it would 
develop 100 gigawatts of solar-energy capacity 
by 2022. This is a huge leap from the existing 
3.7 gigawatts of solar capacity — just 1.4% of 
India’s total electricity generation today. 


“WE ARE CAUGHT 
INA VICIOUS 
CIRCLE OF 
MEDIOCRITY.” 


“India is one of the most attractive markets 
in the world,’ says Pashupathy Gopalan, Asia 
Pacific head of SunEdison, a global solar- 
energy company based in Maryland Heights, 
Missouri, which is joining Adani Enterprises 
of Ahmedabad to build India’s largest solar- 
panel-manufacturing facility. “We are entering 
a new era where solar electricity is competi- 
tive and has achieved ‘socket parity’ with other 
sources of energy in India.” 

There are other big international 
collaborations. The Solar Energy Research 
Institute for India and the United States 
was established in 2012 to target emerging 
research areas. In one project, researchers 
are trying to generate solar thermal power 
by using sunlight to heat up a highly com- 
pressed fluid form of carbon dioxide so that 
it turns electricity-generating turbines. This 
could be used in much smaller plants than 
conventional steam-driven turbines. 

But some analysts say that India suffers from 
“gigawatt obsession”. The focus on giant solar 
plants comes at the expense of smaller facili- 
ties that do not require large parcels of land, 
but could provide electricity to isolated towns, 
even without being connected to the grid. 

“The gigawatt rush must pay attention to 
the pace with which the capacity is to be built 
in India,” says Satish Agnihotri, former sec- 
retary of India’s Ministry of New and Renew- 
able Energy. Plans to build large plants could 
run into opposition in densely populated or 
heavily farmed areas, and in remote areas it 
can be difficult to hook gigawatt projects up 
to the electrical system. 

News and debates about the government's 
current focus on solar power have over- 
shadowed past successes in wind energy. 
India has more than 23 gigawatts of installed 
wind-power capacity, which puts it roughly 
even with Spain as the world’s fourth biggest 
producer. And Mumbai-based Suzlon is the 
world’s seventh-largest turbine manufacturer. 

India has been able to develop its 
wind power in part because of long-term 
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government policies and financial incentives, 
as well as a growing interest from independent 
power producers and financiers, says Shantanu 
Jaiswal, lead analyst at Bloomberg New Energy 
Finance in New Delhi. But some of the con- 
cerns about solar power also hamper wind 
projects, which face difficulty acquiring land, 
encounter lengthy permitting processes and 
often have trouble connecting to the electrical 
power grid. 


EDUCATION OUTLOOK 

Back on her leafy campus in Delhi, Srivastava 
and her fellow young faculty members are 
less concerned about big national projects 
than about producing their own high-quality 
research. They are lucky, they acknowledge, to 
work in one of India’s top federally funded uni- 
versities, which has superior faculty members 
and equipment. 

Others are not so fortunate. India has some 
700 universities of varying quality, from the 
elite institutions funded by the central gov- 
ernment to more than 300 state universities 
and about 200 private ones. “The landscape of 
science education is uneven,” says Sri Krishna 
Joshi, former director-general of India’s Coun- 
cil of Scientific and Industrial Research (CSIR) 
and former chair of the advisory committee 
of the University Grants Commission, which 
funds and oversees university education in 
India. 

In the top institutions, he says, “science 
students are doing world-class research, pub- 
lishing in leading journals and boosting the 
global reputation of our country”. National 
scientific research institutes and leading uni- 
versities have all contributed to India’s grow- 
ing strengths in research: the country’s output 
of scientific publications quadrupled between 
2000 and 2013. 

Even so, India is not keeping pace with some 
other emerging nations, which have increased 
their scientific output more quickly (see 
page 142). And the advances in India’s global 
science metrics mask some signs of declining 
quality in university science education, espe- 
cially at the cash-starved universities funded by 
state governments that account for the major- 
ity of India’s science undergraduates, says Joshi. 
Publicly supported universities depend on the 
Ministry of Human Resource Development for 
funds, and the higher-education budget was hit 
by a 3% cut in the 2014-15 budget cycle. 

“Lack of even bare, minimal and sustainable 
funds for teaching, let alone research, has seri- 
ously plagued the quality and standards of sci- 
ence education,” says Krishna Ganesh, a chemist 
and director of the Indian Institute of Science 
Education and Research in Pune, one of five top 
universities set up in India since 2006. 

Many students at state universities are 
receiving a substandard education, says Joshi. 
“Here, there are no good science teachers, no 
good Indian textbooks, and most of the science 
laboratories are poorly equipped,” he says. 
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Graduates celebrate at the University of Delhi, a top institution. The majority of science students in India graduate from lower-quality universities that lack funding. 


“We are caught in a vicious circle of 
mediocrity,” says geneticist Deepak Pental, for- 
mer vice-chancellor of the University of Delhi. 

Most analysts are concerned over the plight 
of science departments in state universities. At 
the University of Calcutta, for example, even 
procuring a laptop involves endless red tape, 
says physicist Amitava Raychaudhuri. At some 
other institutions, support from funding agen- 
cies helps to purchase equipment, but there is a 
shortage of qualified faculty members to train 
the students. 

Beyond questions of quality, the quantity 
of available university spots is a persistent 
problem. India has gone through a univer- 
sity building boom, but there still is a huge 
shortage of slots for students (see Nature 472, 
24-26; 2011). 

“There is a rise in the number of students 
going for higher education in India, which 
reflects the rising aspirations of its society. 
But this rise should be matched by better 
infrastructure and financial support,” says 
Joshi. 


RESEARCH INVESTMENTS 
Investments in science have also dragged. 
India’s research intensity — the share of its 
gross domestic product devoted to research and 
development (GERD) — remains lower than 
those of many other nations, including Brazil 
and Russia. Twenty years ago, Indias GERD 
exceeded China’. Now, it is less than half. 

But those numbers do not tell the whole 
story, says Ashutosh Sharma, secretary of 


the government’s Department of Science and 
Technology — one of India’s largest research- 
funding agencies. “The total funding is, 
perhaps, not as poor as it seems in terms of 
absolute numbers, because the number of full- 
time scientists doing research is also low” 

India averages about 4 full-time researchers 
per 10,000 people in the labour force, whereas 
China boasts 18 and nations with advanced 
science and technology sectors have around 
80. “India spends about $150,000 per scientist 
per year, which is probably not too far from the 
optimal levels,’ says Sharma. 

India’s notorious bureaucracy deserves part 
of the blame for the problems afflicting science 
education and research. The administrators of 
several state universities are political appoin- 
tees rather than leading academics. “Often 
the appointed person has never been exposed 
to a good university in India or abroad, says 
Kizhakeyil Sebastian, chair of the science-edu- 
cation panel of the Indian Academy of Sciences 
in Bangalore. 

“There is over-bureaucratization within 
the universities and their controlling bodies,” 
says Pental. It often takes two years to recruit 
an academic after announcing an open post, 
which means that the best applicants can slip 
away, says Raychaudhuri. 

The governmental quagmire has begun to 
affect some elite national research institutes, 
too. Of the 38 national laboratories that are part 
of the CSIR, only 25 have full-time directors. 
The rest are making do with acting directors, 
or temporary arrangements. 
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Even the CSIR headquarters in New Delhi 
has been without a full-time leader since Janu- 
ary 2014. Interim director-general Madhukar 
Garg says that “the current situation is indeed 
challenging. CSIR is the backbone of scien- 
tific and technological research in the country. 
In case the prevailing scenario continues, it 
will affect the national innovation system as 
a whole.” 

Sharma acknowledges that red tape is “all- 
pervasive’, but he says that the challenges are 
not bogging down Indian science. “In terms of 
output indicators such as the number of papers 
per dollar spent, Indian science is among the 
very top performers in the world,’ he says. 

And there are some signs that India might be 
slowing its debilitating brain drain. Although 
the vast majority of Indians who obtain science 
doctorates in the United States remain there 
for at least 5 years after graduation, the propor- 
tion has declined: from 89% in 2001 to 82% in 
2011, the most recent year for which data are 
available. 

Kaustuv Datta, a geneticist at Delhi 
University South Campus, is one of those who 
returned. Datta may “hate the red-tapism” at 
universities in India, but he still prefers doing 
research back home. “My parents are here, in 
India. And academics have a strong, positive 
influence on the next generation of students,” 
says Datta. “I wanted to make that contribution 
in India” = 


T. V. Padma is a science journalist in New 
Delhi. 
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n 12 April, Krishnaswamy VijayRaghavan 

posted an update to his more than 2,500 

Facebook friends. It announced a bold 
plan from India’s Department of Biotechnology 
(DBT) — the agency that VijayRaghavan leads, 
and the country’s largest funder of biomedical 
research — to establish a new marine-biology 
institute and research stations along India’s vast 
coastline. Within hours, 500 people had ‘liked’ 
the post and more than 60 had left comments 
of congratulations. 

Only one offered a critical note. A graduate 
student said that starting programmes is all well 
and good, but the DBT must hold the research- 
ers whom it already funds accountable for the 
quality of their science. Shortly after, Vijay- 
Raghavan replied: “Your words are very wise 


© 2015 Macmillan Publishers Limited. All rights reserved 


ANTI-BUREAUCRAT 


and correct! Thank you. We must keep your 
points in mind if we are to get maximum for 
our Rupee and have quality science.” 

It is rare for a public official to be so 
responsive and open to criticism, especially 
in a country as steeped in bureaucratic 
hierarchy as India, says biologist Inder Verma 
at the Salk Institute for Biological Sciences in La 
Jolla, California, who has served as a scientific 
adviser to the Indian government since 
the 1980s. Yet almost anyone who contacts 
VijayRaghavan by Facebook, Twitter or e-mail 
gets a personal response in minutes. “Vijay is a 
breath of fresh air,’ Verma says. 

VijayRaghavan is more than that. He is a 
respected fly geneticist and administrator 
who helped to build the National Centre for 
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Biological Sciences (NCBS) in Bangalore, one 
of India’s most prestigious institutions, from 
the ground up. In January 2013, he left his job 
as NCBS director and moved to New Delhi to 
lead the DBT. He says that he wants to inject 
rigour into Indian science and train scientists 
to work together on tractable problems. As 
grand visions go, his can seem muted, almost 
modest. “I’m not going to be stupid and try 
something completely nutty; I’m going to try 
something within my grasp,” he says. 

Researchers are optimistic about what he 
might be able to achieve. “It's very rare to have a 
scientist of Vijay's calibre heading a government 
department,” says Jyotsna Dhawan, a stem-cell 
biologist who worked with VijayRaghavan for 
seven years. “So I thinkall of us in the scientific 
community have very high hopes.” 

But they also recognize the challenges, 
which include wrangling with New Delhi's 
murky politics — known for ensnaring plans 
in red tape — and the DBT’s long, painful 
grant-review process. In the past couple of 
years, the Ministry of Finance has made it 
difficult for the agency to honour even 
approved grants. And although the DBT is 
a major funder of extramural research, the 
money that it actually gets each year — a 
little more than 14 billion rupees (about 
US$225 million) — is a fraction of that 
commanded by analogous agencies elsewhere, 
such as the US National Institutes of Health. 

Given the challenges, even the most ardent 
well-wishers are holding their applause. “It’s 
not entirely apparent to me what an individual, 
even one so dynamic and forward-looking as 
VijayRaghavan, can do to cut through the red 
tape,” says Dhawan. 


A PASSION TO LEARN 

A self-described “air-force brat”, Vijay- 
Raghavan grew up all over India, moving every 
few years. He was hungry for knowledge, and, 
as a teenager, used to cycle to his local branch 
of the British Council or the US Information 
Service — the main sources of foreign publi- 
cations in those days — and read every book 
and magazine that he could find. “In the pre- 
Internet days, that was my food,” he says. 

After studying chemical engineering at 
the Indian Institute of Technology Kanpur, 
VijayRaghavan was preparing to leave for a 
bioengineering PhD programme in 
Switzerland when he chanced on an article 
by renowned molecular biologist Obaid 
Siddiqi on using genetics to understand the 
nervous system. It was a departure from the 
work that VijayRaghavan had originally 
planned to do, but, he says, “I found the 
formalism of genetics easy to grasp, and that 
excited me very much.” 

He sought out Siddiqi at the Tata Institute 
of Fundamental Research (TIFR) in Mumbai, 
where he began a PhD programme. It was, at 
the time, a place that afforded considerable 
freedom to its students. “You did what you 


pleased and you joined whomever you wanted 
to for your research,’ VijayRaghavan says. “It 
was an exhilarating experience.” 

But there was a growing air of complacency 
and nepotism at the TIFR that frustrated 
Siddiqi. For years, he had been planning to 
build a new institute, and he saw a natural ally 
in VijayRaghavan. The pair began to hatch 
plans, even as VijayRaghavan embarked on 
further training at the Medical Research 
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would enable him to write several high-profile 
papers defining specific events in Drosophila 
muscle development. 

Other faculty members, such as Gaiti 
Hasan, M. K. Mathew and Jayant Udgaonkar, 
were publishing groundbreaking papers in 
cell signalling and protein folding, which in 
turn helped to entice other scientists to join 
the NCBS. “We made extraordinarily rash 
promises that we would do everything for 


“| THINK ALL OF US IN THE SCIENTIFIC 
COMMUNITY HAVE VERY HIGH HOPES.” 


Council Laboratory of Molecular Biology in 
Cambridge, UK, and later, undertook a post- 
doctoral fellowship at the California Institute 
of Technology (Caltech) in Pasadena. 

Elliot Meyerowitz, VijayRaghavan's adviser 
at Caltech, says that lab members routinely 
tried to flummox foreign postdocs with US 
slang and customs, but they could never rattle 
VijayRaghavan. “I don’t know whether he 
understood, or if he was just so cool, we didn’t 
know he didn’t understand,’ Meyerowitz says. 
VijayRaghavan says that he did understand, 
thanks to his time devouring British and 
US magazines. 


FROM THE GROUND UP 

In 1988, VijayRaghavan left Caltech and 
returned to India to head a lab at the TIFR, 
and he, Siddiqi and a handful of other scien- 
tists laid groundwork for the research centre 
in Bangalore. They wanted the institute, 
which would be named the NCBS, to be dif- 
ferent from any in India before it. Siddiqi 
became founding director, but VijayRaghavan 
and a few others were closely involved in its 
development. “We were in the trenches 
together — young, some very talented, all very 
driven,” VijayRaghavan says. “We had a sense 
of rebellion” 

From the start, VijayRagahvan wanted to 
recruit people trained in multiple disciplines 
who were focused on cutting-edge techniques, 
such as single-cell analysis, says statistician 
Partha Majumder. “This trait of being able to 
look way into the future is what sets him apart.” 

In 1991, VijayRaghavan moved to Bangalore 
to launch the NCBS'’s first lab. Over the next 
year, two more faculty members joined him. 
The entire centre was a “shack”, he recalls, 
situated on one floor of the radio-astronomy 
building at the Indian Institute of Science. 
VijayRaghavan had to cycle 1.5 kilometres 
to the nearest biology lab to photograph his 
DNA gels. “We had an absolute ball ofa time,” 
he says. 

Along with building the institute, Vijay- 
Raghavan was strengthening his scientific 
reputation. He borrowed equipment to set up 
a series of elegant genetic experiments that 
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them, which we did” VijayRaghavan says. 

In 1993, for example, VijayRaghavan 
learned that Satyajit Mayor, a cell biologist in 
New York City whom he was trying to recruit, 
needed a pricey Zeiss inverted microscope. 
VijayRaghavan had been promised some 
equipment for his own lab through a grant 
from the Rockefeller Foundation in New 
York City, but stringent rules from the TIFR 
and the Indian government had held up the 
procurement for years. He changed his request 
to get the microscope instead. 

All that Mayor knows of the negotiation 
is that he sent an e-mail to VijayRaghavan 
one night telling him that he would not be 
able to join the institute without that type of 
microscope. He woke up the next morning 
to VijayRaghavan’s reply: “It ll be here when 
you arrive.” Mayor joined the NCBS about 
18 months later. 


UNITED BY SCIENCE 

VijayRaghavan took over from Siddiqi as 
director of the NCBS in 1996. As the institute 
grew, he strove to build a democratic system, 
in which even graduate students had a say, and 
criticism was not just accepted, but expected. 
Before moving ahead with any new plans, he 
always made sure that he “brought people 
along with him’, says Mayor, now the insti- 
tute’s director. 

In 1999, for example, the leaders of the 
institute were considering adding a master’s 
programme in wildlife ecology and conserva- 
tion. At first, only 3 or 4 of its then 22 faculty 
members were in favour of the idea. At a 
meeting, VijayRaghavan carefully listened to 
the pros and cons, and was ultimately able to 
convince everyone, recalls Mayor. “Everybody 
left that meeting feeling like wed done the 
right thing,” he says. “The way the discussion 
and the dialogue and the arguments were put 
across, quite masterfully, was so Vijay.” 

The programme has become one of the 
institute's most successful, with eight field 
stations across the country and faculty 
members drawn from the United States and 
Germany. By the time VijayRaghavan left, the 
NCBS was widely regarded as one of India’s 
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K. VijayRaghavan (bottom right) in a class photo at the Tata Institute of Fundamental Research in 


Mumbai in 1980. Obaid Siddiqi sits top left. 


leading research organizations. 

But the NCBS, and a few other select 
institutes, are exceptions in India. Much of 
the country’s science is beset with the same 
problems it has had for decades — intermi- 
nable waits for reagents, or a granting scheme 
that places a 3-year limit on funding, forc- 
ing researchers to write new applications in 
unrealistic cycles. 

VijayRaghavan’'s predecessor at the DBT, 
Maharaj Kishan Bhan, had done much to 
modernize Indian research and make it more 
independent, including helping to develop a 
low-cost rotavirus vaccine and setting up an 
organization to support entrepreneurs. “If 
Bhan was not able to succeed in some places,” 
says Verma, it was because of limited resources 
and not having enough people to support his 
vision. “Perhaps he bit off more than he could 
chew — the same could happen to Vijay.” 


BACK TO BASICS 

Since his arrival at the DBT, VijayRaghavan 
has unveiled a few plans. Financially, Indian 
science is no match for that in the United 
States, Europe or China, something that 
he freely acknowledges. But he says that India 
can make big gains by capitalizing on its 
advantages and by collaborating with others. 

His main priorities are to invest in basic 
research areas — such as computational 
biology, in which India is already strong — to 
break down the barriers between disciplines 
and to improve training for all scientists. 

“I think he’s got a very, very strong vision 
about the importance of fundamental and basic 
science,’ says Eve Marder, a neuroscientist at 
Brandeis University in Waltham, Massachusetts, 
who serves alongside VijayRaghavan on the 
scientific advisory board for the Janelia 
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Research Campus in Ashburn, Virginia. 

The DBT’s marine-biology initiative 
exemplifies this vision. The effort, intended 
to chart biodiversity and identify compounds 
for biotechnology development, is the 
DBT’s — and VijayRaghavan’s — brainchild. 
But it involves the Indian Space Research 
Organisation in Bangalore and the Ministry of 
Earth Sciences in New Delhi, both first-time 
partners for the DBT. In addition, the French 
National Centre for Scientific Research and 
the Pierre and Marie Curie University in Paris 
will help to train Indian researchers. The 
project is part of VijayRaghavan’s strategy 
to compensate for the DBT’s limited budget 
by partnering with every ministry that has 
funds allotted for science and technology, 
including the sanitation, maternal-health and 
nutrition ministries. 

In the next year, he plans to roll out an Indian 
body modelled after the European Molecular 
Biology Organization — a scientific society that 
would promote India as a hub for international 
collaborations and offer online training for 
scientists at all levels. He is also encouraging 
local collaboration on training and research. 
In the Delhi area, for example, he plans to 
persuade the leaders of well-established 
immunology, mathematics, engineering and 
medical institutes to work together. “Boom!” he 
says. “Within a few years, you're going to have 
extraordinary-quality people being trained, 
both engineers and clinicians”. 

In the short term, he wants to play to India’s 
strengths. Getting India’s thriving community 
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of mathematicians and computer scientists 
to work on problems in biology, for example, 
could help the country to gain an edge in 
bioinformatics and quantitative biology — 
fields that do not typically require as much 
funding as bench biology. 

This is all easier said than done, Vijay- 
Raghavan admits, but he intends to use 
financial incentives and disincentives — what 
he calls “fire in the belly” and “fire in the rear” 
— to make it happen. 

In the past, the DBT has set out strict 
budgetary allocations at the beginning of each 
year, and had little flexibility in later months. 
But last year, VijayRaghavan set aside a pot 
of about $33 million from the DBT’s annual 
budget. The money can be used to respond to 
innovative ideas, reviewed by international 
experts. This is sure to aggravate some insti- 
tutes that are used to being well-funded, but 
VijayRaghavan is unmoved. “It’s about time 
that we recognize excellence and recognize 
shoddiness,” he says. 

VijayRaghavan is also tackling the grant- 
review process. He is streamlining the DBT’s 
online application system, he has created time- 
lines for submitting grants and has introduced 
the DBT’s first open-access and conflict-of- 
interest policies. 

“When people complain about problems in 
India, it’s rather astounding how few of us are 
actually doing something about them,’ he says. 
“If you actually start doing something about 
anything, the situation changes.” 

But some colleagues have concerns. Mayor 
says that VijayRaghavan’s desire for consensus 
and harmony could prove a weakness. “He is so 
keen to be extremely positive about everyone,’ 
Mayor says. “When you're operating in the real 
world where you have to get things done, that, 
I would say, is a bit of a problem” 

And in a government department, 
VijayRaghavan will not be able to hand-pick 
people who share his vision, as he did at the 
NCBS. “I'ma little concerned that if he doesn’t 
have that, he will burn out, because he will try 
to do it all himself? Mayor says. 

VijayRaghavan shows no sign of burnout 
yet. He still maintains a lab, albeit a lean one, 
conferring with his team in the evenings 
through Skype and returning to Bangalore 
every weekend. Besides the tweets and 
Facebook posts, he blogs and makes time to 
run several times a week — and, he says, he is 
having fun doing it all. 

“Thave to tell you one simple rule in any job. 
If you wake up for several days in a row and 
say, ‘Why am I doing this?’ then you're better 
off quitting,” he says. “Not only has that not 
happened, I’m actually quite excited when I 
wake up every day. I just look at the day and 
hit it hard.” = 


Apoorva Mandavilli is a freelance science 
journalist based in New York and editor-in- 
chief of the autism news site SFARLorg. 
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Priorities for science in India 


Ten Indian research leaders give their prescriptions, from better funding, facilities, 
mentoring and education to greater respect, fairness, autonomy and confidence. 


Share dataon 
water resources 


Professor of civil engineering, Indian 
Institute of Science, Bangalore 


India is facing an imminent water crisis. 
Almost 100 million people have no access to 
safe drinking water, and most others experi- 
ence regular shortages. More than one-third 
of the roughly 400 rivers that are monitored 
by the government are polluted. 
Groundwater levels are alarmingly low in 
many areas, owing to overexploitation for 
irrigation and domestic supply. An estimated 


60% of groundwater sources will be degraded 
in two decades. Cities consume vast amounts 
of energy to pump water over long distances 
from rivers and reservoirs, and unplanned 
urban growth is blocking drainage channels, 
causing flooding. Climate change will make 
matters worse. Water availability, demand and 
quality, as well as floods, droughts and salinity 
intrusion, will be affected. 

But across India, hydrological research is 
hindered by a lack of access to good-quality 
data. The government bodies that are custodi- 
ans of hydrological, meteorological, environ- 
mental and agricultural data are reluctant to 
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share information openly. Combined with 
bureaucratic hurdles, this means that Indian 
researchers must either use poor-quality data 
or turn to US or European records. 

To strengthen hydrological research 
and promote scientific decisions on water 
policy, the government must upgrade its 
data-collection, monitoring, communica- 
tion and storage networks, in terms of both 
technology and density. The government's 
Water Resources Information System is 
an excellent start. Now it needs to provide 
real-time data on stream flow, soil moisture, 
groundwater levels and evapotranspiration. 

‘Critical zone observatories’ that measure 
atmospheric, hydrological, biogeochemi- 
cal, ecological and other fluxes in Earth’s 
near-surface zone should be set up in each of 
India’s seven hydro-climatic regions and 
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> integrated with others globally. Observa- 
tories should span different types of climate, 
terrain, demography, land use and land cover. 
India needs multidisciplinary centres of 
excellence to address big questions — on 
water-system response rates to climate 
change, coupled forecasting of intense 
precipitation and floods, medium-range 
weather forecasts for agricultural water man- 
agement and water contamination. These 
centres would also train the next generation 
of researchers to use holistic approaches. The 
Indian Institute of Science, Bangalore, has 
established such a centre this year. This step 
should be emulated nationwide with fund- 
ing from government and private industry. 


HIRIYAKKANAVAR ILA 
Support the bulk 
of students 


Professor of chemistry, Jawaharlal 
Nehru Centre for Advanced Scientific 
Research, Bangalore 


India’s university system is broken. The 
higher-education system was started by the 
British in 1857 with the establishment of three 
universities — Calcutta, Bombay and Madras 
—and 28 affiliated colleges. After independ- 
ence in 1947 and the creation of the Univer- 
sity Grants Commission (UGC) in 1956, the 
number of universities grew exponentially. 
Today, there are more than 600, including 
about 200 private ones. Of the public universi- 
ties, 46 are funded by central government the 
rest by state governments. A few, such as the 
central University of Hyderabad, do world- 
class research. 

In addition to these, to improve training 
in basic sciences and technology, the 
government established 16 Indian Insti- 
tutes of Technology and 5 Indian Institutes 
of Science Education and Research. There 
are also about 40 Council of Scientific and 
Industrial Research laboratories engaged in 
applied research, along with a few premiere 
research institutes, including my own. 

The handful of these that compete at the 
international level contribute the bulk of 
high-quality scientific research in India. But 
they educate a tiny fraction of our students. 

Facilities and teaching at the universities 
that serve more than 29 million students are 
alarming. Mostare ‘chalk and talk’ classrooms 
with poor-quality teaching laboratories, let 
alone research laboratories. Faculty appoint- 
ments are often made on the basis of political 
connections, caste or bribes, and funds are 
misappropriated. Inbreeding results: many 
highly qualified young scientists refuse to 
take up faculty positions in these universi- 
ties because of the lack of infrastructure, the 
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hostile environment and bureaucracy. 

This is a disturbing situation. India needs 
trained, innovative minds to meet its for- 
midable challenges. The state and central 
governments should take urgent action. 

The government should appoint highly 
qualified, broad-minded vice-chancellors, 
who will recruit qualified faculty members 
and give them state-of-the-art research 
facilities with no external interference. Fac- 
ulty should then focus on basic research and 
quality teaching, and encourage regional 
and international collaboration networks to 
strengthen scientific research. The govern- 
ment should also create many specialized 
research centres in the universities (like the 
CNRS in France). Fixing our university sys- 
tem will require a complete overhaul of the 
UGC, changes in institutional policy and 
legislation. This will be difficult with the pre- 
sent disconnect between science and policy 
in a government that has cut research budg- 
ets, focused on manufacturing and dissolved 
its scientific advisory committee. 


YAMUNA KRISHNAN 
Crack the cliques, 
enable visionaries 


Professor of chemistry, University of 
Chicago, Illinois 


To catapult India into the top five scientific 
nations, the country needs enabling policies 
that money can't buy. India has huge posi- 
tives but it is hamstrung by socio-cultural 
issues, two of which I address here: a herd 
mentality and a paucity of early-stage men- 
torship. My ideas stem from my 15 years asa 
graduate student and young research-group 
leader in India. 

Having recently moved from the National 
Centre for Biological Sciences in Bangalore 
to Chicago, Illinois, I have noticed a funda- 
mental difference in the attitude of young 
US scientists from that of their Indian coun- 
terparts: their appetite for big problems. 

‘Going for great’ is 


“The country a skill acquired very 
needs enabling _ early onin the West. 
policies that Senior researchers 
money can’t spot gifted graduate 
buy. 4 students, connect 


them with the best 
scientific mentors, nurture them and ensure 
their visibility over decades. 

In India, researchers generally start being 
mentored only when they show promise as 
young principal investigators. Thus a fresh 
returnee from a leading postdoctoral lab 
abroad suddenly becomes essentially invis- 
ible to key collaborators or contacts at home 
and elsewhere. This results in the returnee 
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pursuing quality problems fragmented into 
smaller stories for more publications, but of 
lower visibility. The strategy is to edge slowly 
towards the big ideas. Often, these big ideas 
are either suddenly solved by counterparts in 
the West, or become outdated. A top-down, 
merit-based, long-term mentorship scheme 
— starting at the graduate-student stage — 
could prove transformative. 

Cultivating excellence is a selective process 
that can be perceived as elitist. But India is try- 
ing its best to become an egalitarian society. 
It has some outstanding senior scientists — 
visionaries who care deeply about taking their 
nation from good to great. But their efforts are 
neutralized bya pedestrian majority intent on 
preserving the status quo. 

Instead, these visionaries need to be 
empowered to take the tough decisions to 
make Indian science a meritocracy. We must 
take a census of researchers in all disciplines. 
Then, preserve scientists with research pro- 
grammes of international standing, regard- 
less of age, solely on the basis of performance 
during the past five years. Give them abun- 
dant support to ratchet up their programmes. 
Identify experienced scientists who could 
each nurture and mentor 5-10 emerging 
scientists and bring them up to an outstand- 
ing level. From such a platform, break open 
the moribund coteries that hold the system 
to ransom without themselves doing cutting- 
edge research. If this can be done, India will 
soon emerge as a scientific superpower. 

I still bubble with optimism. India 
allows young people with the right attitude 
to thrive. The nation’s history has many 
examples of the conscience of the major- 
ity successfully rejecting deeply embedded 
socio-cultural mindsets. 


JOYASHREE ROY 
Train more energy 
economists 


Professor of economics, Jadavpur 
University, Kolkata 


The energy sector will drive India’s economic 
growth for the next three decades. Better 
access to electricity and cleaner fuel sources 
will enhance the population's health and well- 
being and boost industry. But the country 
faces major challenges, from implementing 
technologies on the ground to staying within 
global carbon-emissions limits while ensur- 
ing energy access for all. 

The discussion so far is one-sided. In 
India, energy is seen mainly as a science- 
and-technology issue. There is money 
for developing microgrids and dis- 
tributed power devices. But no serious 
research is being funded to examine the 
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socio-economic impacts and influences. 

How will distributed generation affect 
energy prices and social dynamics? What will 
happen when new actors such as suppliers of 
low-carbon energy and ‘producer-consum- 
ers’ enter the fray? Is there an optimum path 
— environmentally, socially and economi- 
cally — for depleting natural resources? 

India needs more energy economists. 
Energy has long been seen as an unfashion- 
able topic in the country’s universities, and 
few researchers specialize in the field com- 
pared with agriculture, trade, finance and 
the environment. India must create a forum 
of energy economists who can discuss and 
compare the models used to develop energy 
strategies and influence policy dialogues 
while understanding local nuances. 

India is diverse, and political contexts 
matter. The energy sector, which meets basic 
service needs, is susceptible to partisan pol- 
itics. But scientists have become distanced 
from policy-makers. Economists need to 
fill the gap by analysing which governance 
structures and regional cooperations might 
emerge under different energy-distribution 
scenarios and technological options. 

A strategy to train the next generation of 
Indian energy economists could follow the 
model of the capacity-building programme 
for environmental economists, which ran 
from 1998 to 2003 through many participat- 
ing universities and institutes, funded by the 
World Bank in collaboration with the then 
Ministry of Environment and Forests. Similar 
efforts are now being made by SANDEE, the 
South Asian Network for Development and 
Environmental Economics. Academics from 
around the world helped to train faculties in 
environmental economics, library content 
was improved, and grants and fellowships 
were offered so that Indian researchers could 


train overseas and build case studies in India. 
Today, almost all universities in India have 
a well-defined, internationally comparable 
syllabus in environmental economics that is 
taught by well-trained teachers to plenty of 
students. It is now mandatory that an envi- 
ronmental economist be a member of each 
state’s environmental impact assessment 
board. A similar approach for energy eco- 
nomics, starting with interested institutes, 
would encourage more researchers to seek 
solutions to India’s energy problems. 


RAGHAVENDRA GADAGKAR 
Solve local 
problems 


Professor of ecology, Indian Institute 
of Science, Bangalore; and president, 
Indian National Science Academy 


Indian science suffers, today more than ever, 
from government apathy. This is exacerbated 
by the fact that India tries to run on the same 
track as the most developed countries and the 
best endowed institutes in the world. Only a 
handful of scientists and institutions in India 
can afford it, and then only by sequestering 
an unfair share of the country’s scant funds. 
Even these players barely compete with their 
chosen peers — never really at the top, but in 
the ‘also ran’ category at best. This leaves most 
researchers and institutions with inadequate 
resources, and worse, feeling backward. 
This is not the only model for success. If 
you cannot compete on the same track, you 
should try a different one. India should cel- 
ebrate and encourage scientists who create 
their own research questions long before 
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others make the topics fashionable, or those 
who bring different perspectives to existing 
problems. Most importantly, we should gar- 
land those who work on problems that are 
crucial to local contexts — even if they are of 
little interest to elite overseas universities or 
to ‘high-impact’ journals. Examples include 
endemic communicable diseases, ground- 
water contamination and traditional methods 
of biodiversity conservation. 

India’s systems for peer review, grants, 
publications, jobs, awards and fellowships 
punish any potential future leaders in such 
‘unsexy’ fields. Instead, the country should 
develop new scientific ethics and etiquette. 
The research community should value, for 
instance, collaboration with small neigh- 
bouring colleges or universities instead of 
recognizing only international alliances. India 
should create a new peer-review system, a 
new ranking of journals and new measures of 
impact — all tailor-made for our needs, prob- 
lems, diseases, natural resources and educa- 
tional system. We need to believe in ourselves 
and not just chase world rankings — as indi- 
viduals, as institutions and as a country. The 
enemy is within. So is the solution. 


VINOD SINGH 
Improve tertiary 
education 


Director, Indian Institute of Science 
Education and Research, Bhopal 


India produces around 9,000 PhD graduates 
a year in science and technology. This num- 
ber sounds large, but for a population of 
about 1.3 billion it is not: the United States 
produces four times as many from a popula- 
tion one-quarter of the size. Moreover, the 
variation in quality of Indian PhD graduates 
and faculty members is a prime concern. 

For India to be at the forefront of science 
and technology we need better govern- 
ance systems for universities, institutes and 
research labs. We need more capable academ- 
ics to provide leadership, nurture young talent 
and establish a superior research enterprise. 

Indian universities are mired in bureau- 
cracy. Archaic ordinances and rules set by 
the University Grants Commission have 
stifled the spirit of academic excellence and 
hampered institutions flexibility. A lack of 
passionate leadership coupled with poor 
funding has blunted their edge. 

Leading the way are premier government- 
funded centres such as the 16 Indian Insti- 
tutes of Technology, the Indian Institute of 
Science in Bangalore, the Tata Institute for 
Fundamental Research in Mumbai, and the 
5 Indian Institutes for Science Education 
and Research. These have one academic 
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director, who reports to a board of governors 
of eminent academics, researchers and indus- 
trialists. An effective leader — with excellent 
research and administration skills — can cut 
through bureaucracy. Other public universi- 
ties should similarly be made autonomous. 

Centrally funded laboratories, tasked 
with industrially relevant research, should 
be run along similar lines and integrated 
with nearby universities and institutes. This 
would strengthen applied and interdiscipli- 
nary research. 

In 2009, the Science Engineering Research 
Board was created to make government 
science funding quicker and fairer. Its per- 
formance now needs to be benchmarked 
against overseas granting agencies such as 
the US National Science Foundation. 

Quality-control mechanisms must be 
established for the national accreditation and 
assessment of Indian PhDs and to improve 
research and educational training. Doctoral 
fellowships and research funds should be 
created in areas of national priority, including 
food security, energy and the environment. 
It is high time that India fixed its tertiary 
education system. 


UMESH VARSHNEY 
Make science an 
attractive career 


Professor and chairman, Department 
of Microbiology and Cell Biology, 
Indian Institute of Science, Bangalore 


Is there a dearth of talent in India? Certainly 
not. Is there a dearth of unstoppable achiev- 
ers and innovators? Yes: because making 


talent shine takes a culture that is proud 
of its scientists and a charged intellectual 
environment that nurtures, mentors and 
drives them. The efforts made by a handful 
of educational institutions, academies anda 
few others are crucial — but inadequate. We 
must halt the deterioration in higher-educa- 
tion standards in hundreds of universities, 
which train and produce huge numbers of 
science undergraduates and graduates. 

Science graduates are deprived of mean- 
ingful practical training because of poor 
funding, government interference, inap- 
propriately recruited faculty members 
and a lack of laboratory facilities in most 
of these centres of learning. At this crucial 
stage in their careers, students are missing 
out on the mentoring required to instil the 
culture of science and the habit of analyti- 
cal thinking and questioning. And once 
scientists are trained? They work with 
inadequate, ill-maintained equipment, and 
in isolation from stimulating peers, being 
so few in number and so geographically 
dispersed. 

It is imperative that the universities that 
produce the largest numbers of science gradu- 
ates are revived so as to be capable of contem- 
porary research. The process can be difficult 
and slow, or expensive and experimental. One 
such experiment would be to fund science 
generously. Another related one would be 
to pay researchers enough to make science a 
socially acceptable profession. 

Meanwhile, the resilient among us must 
continue by jugaad — the characteristically 
Indian technique of making do — to try to 
maintain the scientific base that exists. If 
only the management of science were left to 
scientists, India could put its research on the 
world map — just as it put the Mangalyaan 
probe into orbit around Mars. 
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KRISHNAN. GANESH 
Connect research 
with education 


Director, Indian Institute of Science 
Education and Research, Pune 


Historically in the Indian education system, 
faculty members who teach undergradu- 
ates do not do research, and those involved 
in research (in national laboratories and 
universities) do not teach undergrads. This 
is the opposite of the conventional Western 
university system. 

To inject research-led undergraduate 
teaching, five Indian Institutes of Science 
Education and Research (IISERs) were set 
up between 2006 and 2008, in Pune, Kolkata, 
Mohali, Bhopal and Thiruvananthapuram; 
the sixth one is being established this year 
in Tirupati. At the IISERs, students are 
exposed to research early in their careers, in 
state-of-the-art labs. Customized curricula 
connect theory taught in the classroom with 
lab experiments. Courses in social sciences, 
ethics and science communication round 
out the education. 

This alliance of education and research 
catapulted the IISERs to fourth place in 
India in the 2014 Nature Index, which 
ranks institutes’ outputs — no mean feat for 
institutes less than a decade old. Together, 
the IISERs now have 350 faculty members 
and 3,500 students and will reach their final 
capacities (2,000 students and 200 faculty 
members per institute) by 2019. 

However, Indian research institutes still 
fare poorly in global rankings in terms of 
publication quality. They must try to attract 
international visiting faculty members and 
research students, and establish good ties 
with industry. More than 60% of the 600 stu- 
dents who have already graduated from the 
IISER system have secured PhD positions in 
leading universities abroad. 

This sort of brain drain is why the Indian 
system is seriously afflicted by a lack of post- 
doctoral fellows, who are the engine of the 
research enterprise elsewhere. Even the best 
professors in India depend mainly on PhD 
students for their research. The government's 
proposed fellowship plan to send Indian 
PhD holders abroad to gain experience and 
training in emerging areas should be con- 
verted to a programme that ‘twins’ Indian 
institutions with foreign research centres, 
with candidates spending half their time in 
India. Fellowships could also be opened to 
foreign nationals wishing to work in India. 
To assure career progression, these should 
dovetail into existing tenure-track systems 
— such as the INSPIRE Faculty Scheme, 
the University Grants Commission Faculty 
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Recharge Programme and the Ramanujan 
and Ramalingaswami fellowships. 

To retain or attract back our best young sci- 
entists, and entice industry investments, India 
must create advanced research facilities and 
assured and scalable research funding, and 
must foster supportive mentors and vision- 
ary institutional leaders. To realize all this, 
the highest-achieving institutions must be 
granted immunity from general budget cuts 
and endowed with 20-30% more in fund- 
ing for the next ten years, in autonomously 
controlled budgets. Germany’s Max Planck 
Institutes provide an ideal governing model. 

This year has seen cuts in the proportion 
of gross domestic product spent on science 
and technology, from an already low starting 
point of 0.9%. This risks not only undoing 
the progress achieved, but also doing irre- 
versible damage. At the same time, many 
important scientific agencies including the 
Department of Science and Technology 
(until recently), the Council of Scientific 
and Industrial Research, the Indian Coun- 
cil of Medical Research and several national 
laboratories have been without chiefs for 
more than a year, which has stalled strategic 
decision-making and dented morale. 

In the absence of the Science Advisory 
Council to the Prime Minister, there is no 
channel for enlightening the government 
on the crucial role that scientists could have 
in addressing India’s growing challenges 
in energy, health, environment, water and 
education. The country’s science academies 
must build such a bridge. India has a vast 
supply of talented young people; it is our 
duty to nurture and harness their talent for 
a better tomorrow. 


SUNITA NARAIN 
Manage waste 
frugally 


Director-general, Centre for Science 
and Environment, New Delhi 


India has a huge waste problem. Untreated 
sewage is defiling rivers and water bod- 
ies; industrial chemicals such as cadmium 
and nitrates are seeping into the ground 
and polluting the air; and solid waste from 
kitchen scraps and plastic packaging is 
piling up in our cities. The problem requires 
more than management. We need innova- 
tive and realistic solutions that match our 
pockets and our regulatory and governance 
abilities. 

Take sewage. Flushing excreta down 
toilets is expensive and resource intensive. 
It uses water as both the carrier and the final 
dumping point. This approach works in 
countries that have the means to build huge 


water-supply and retrieval infrastructures 
and to pay for maintenance and upgrades of 
technologies to manage and treat pollutants 
— from biological waste to toxins. It does not 
work in India, where there are limited funds 
for supplying essential services to more than 
one billion people. A country that is poor — 
but fast becoming richer and more wasteful 
— needs a whole new paradigm. 

The key obstacle is that everyday chal- 
lenges are not top priorities for research and 
innovation. Indian science has always been 
fascinated by the ‘masculine’ agendas of 
space and genetics, not reinventing the toilet. 

Instead, science must meet the needs 
of poor people. We need to devise ways to 
prevent pollution rather than cleaning it up 
afterwards. Indian research has to be more 
humble, nimble and investigative. It has 
to learn from its poorest and most illiter- 
ate people: how they cope with scarce and 
diverse resources by being frugal and in tune 
with their environment. 

India’s ambition should be to become the 
front-runner in the race to save the planet. 


NABAK. MONDAL 
Build big physics 
facilities 


Senior professor, Tata Institute of 
Fundamental Research, Mumbai 


India has an illustrious history in high- 
energy physics. But two factors make me 
worry that it will struggle to maintain its 
position: a shortage of instrument builders 
and the difficulty of getting planning per- 
mission for big facilities. 

Technological advances lie behind 
breakthroughs in particle physics. Indian 
scientists’ enthusiasm and skill for building 
particle detectors put them at the forefront 
of the field early on. In the 1950s and 1960s, 
Indian physicists pioneered experiments 
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with cosmic rays, and developed cloud 
chambers for use at high altitude. The first 
published detection of atmospheric neutri- 
nos was made in 1965 with an instrument 
installed in a mine at Kolar Gold Fields 
(KGF) in Karnataka state. The first dedi- 
cated experiment to study proton decay was 
carried out at KGF in the early 1980s. 

Today, there is little enthusiasm among 
India’s young researchers for building instru- 
ments. One reason is that the pay-off is years 
in the making: researchers lose out in terms 
of publications compared to peers working 
in the lab or doing theoretical research. They 
find it difficult to compete in the academic 
job market. 

Unless we devise metrics that recognize 
instrument development and retain these 
skills, it will be difficult to host high-energy- 
physics experiments in India. India’s partici- 
pation in international projects will be limited 
to data analysis, making us unequal partners. 

Another obstacle is the slow and complex 
approval procedures for large experimental 
programmes in India. This is compounded 
by widespread opposition to large-scale pro- 
jects by political opportunists and activists 
on flimsy grounds. In a healthy democracy, 
meaningful debates are welcome. In India, 
they are increasingly becoming indiscrimi- 
nate and adversarial. 

For example, controversy has broken out 
over the proposed India-based Neutrino 
Observatory, an underground lab in Tamil 
Nadu for research on neutrinos and related 
particle physics. The project, of which lam 
director, received government approval in 
December 2014. To stay globally competi- 
tive, it needs to be up and running by 2020. 
But we are far from breaking ground. By 
spreading fictitious fears about neutrinos, a 
small local political party and a handful of 
activists have sowed doubts in the minds of 
local people and made it extremely hard for 
us to get the required planning permissions. 

Unless scientists speak up collectively, 
it will be prohibitively difficult to develop 
major science infrastructure in India. m 
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Kamla Devi, Rajasthan’s first female solar engineer. 


Rethink India’s 
energy strategy 


Address the needs of poor and rural households, target subsidies and support 
low-carbon industries, urge Arunabha Ghosh and Karthik Ganesan. 


energy goals: providing everyone with 
access to energy, securing energy supply 
and trying to limit carbon emissions with- 
out encumbering the nation’s growth. These 
important concerns miss the point. 
Energy access cannot be assured by 
progress towards a simple target such as 
supplying power 24 hours a day, 7 days a 
week, nationwide. India has deep divides in 
the quantity and quality of energy consumed 
across income groups and between rural 
and urban households. Fuel subsidies are 
poorly designed and the strategies to reduce 
them to enhance energy security are heavy- 
handed. And because of limited action by the 
world’s largest emitters, there is little left in 
the global carbon budget before planetary 
safety limits are breached. Clean energy and 
alternative growth is imperative. 


[&< policy-makers have three big 
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India’s energy priorities should be reframed 
as follows: to cater to the different energy 
demands of citizens of various economic 
strata; to direct energy subsidies to benefit the 
poor; and to promote low-carbon industry. 


DISPARATE DEMAND 

Urban India aspires to have a reliable 24/7 
electricity supply — voltages currently drop 
at peak demand times such as during eve- 
nings. Meanwhile, more than one-third of 
India’s households, mostly poor and rural, 
are not connected to the electricity grid. 
For those that are, blackouts last 4-16 hours 
a day. The poorest households consume 


SCIENCE IN INDIA 


A Nature special issue 
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one-quarter of the energy of those at the 
highest income levels. Urban centres are in 
effect subsidised by rural areas, which are 
being overcharged for substandard service’. 
The poorest households pay 30% more per 
unit of useful energythan the richest’. 

One solution to these disparate demands 
is to deliver more electricity through the 
grid while adopting cleaner energy sources. 
The Indian government has announced 
ambitious plans for renewable energy: up 
to 175 gigawatts (GW) of installed capac- 
ity by 2022. There are many challenges to 
achieving this target, from the availability of 
resource data on which to base decisions and 
managing risks to the high cost and the huge 
variability across the grid in terms of energy 
sources and infrastructure. 

Meanwhile, the promise of reliable elec- 
tricity through centralized infrastructure 


SUZANNE LEE/PANOS 


SOURCE: REF. 7 


and systems remains unfulfilled. This is in 
part because most electricity utilities suf- 
fer financial difficulties — they lost more 
than US$19 billion in 2012-13 (ref. 3). One 
solution is to tap smaller-scale distributed 
renewable energy sources, primarily solar, 
biomass and small-scale hydropower. Off- 
grid power based on these technologies 
has advantages such as network resilience, 
flexibility and environmental and health 
benefits’. 

More than one million households in 
India rely on solar off-grid systems for light- 
ing. A further 20 GW of energy capacity 
could be achieved if 15% of irrigation pumps 
were converted to solar energy. Renewable- 
energy applications can provide heating, 
cooling, cooking and mechanical power as 
wellas electricity’. 

More than 250 companies across India, 
with long supply chains and networks of 
village-level entrepreneurs, operate in the 
decentralized clean-energy sector already. 
They demonstrate that putting power in the 
hands of poor people can begin a transfor- 
mation in how energy access is understood 
and delivered. At the same time, such rapid 
growth and geographical spread could result 
in variable quality of service and expensive 
energy for poor people. More training would 
help to keep up standards. 

The challenge is to balance two types of 
investment: those in the centralized grid, 
which are key to the aspirations of millions 
of higher-income households, and funds for 
standalone systems in isolated and under- 
served areas or for integrating such systems 
to the grid. 


RATIONAL SUBSIDIES 
Another reason for pursuing renewable 
energy in India is to avoid the pitfalls of a 
growth strategy mostly based on fossil fuels. 
Already, imports account for more than 80% 
of India’s crude oil and 25% of its coal and 
gas, raising worries about supply and price 
volatility’. Petroleum constitutes nearly 30% 
of all commodity imports, leaving India 
little fiscal room to shrink its large current 
account deficit. 
India hands out generous energy subsi- 
dies, most of which are not means-tested 
(see ‘Energy imbal- 


“The poorest ance’). For exam- 
householdspay _ple, in 2013-14 the 
30% more than government gave 
the richest per away $8 billion 
unit of useful worth of subsidies 
energy.” for liquified petro- 


leum gas (LPG)’. 
Yet less than half of urban households and 
only 6% of rural ones exclusively use LPG 
for cooking. Traditional biomass fuels such 
as wood account for 20% of Indian house- 
holds’ energy use. The government must 
rationalize subsidies and target them better. 


ENERGY IMBALANCE 


Liquified petroleum gas (LPG) fuel is heavily 
subsidized by the government, even though 
it is used mainly by high-income families. 

A rural electrification programme started in 
2005 has improved the fairness of electricity 
consumption. 


1 | Share of LPG subsidy, by income bracket 


| Richest 20% [J Middle 50% [J Poorest 30% 


The richest 20% 
receives 39% of 
the subsidies. 


2 | Energy-consumption inequality 
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India’s richest 


‘Og 10% consumes 
20% of electricity 
and 33% of LPG. 


Proportion of total consumption 
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A well-designed programme would increase 
access to modern cooking energy (electricity 
and gas) for the same budget. For instance, 
reducing subsidized LPG to 9 cylinders 
(instead of 12) per year per connection could 
save the government $724 million. Exclud- 
ing the richest 15% of households from the 
subsidy could save $1.18 billion annually. 
The savings should be redirected to increas- 
ing the availability in rural areas of cleaner 
cookstoves and biogas, and could extend LPG 
provision to 30 million more households. 


WHAT TO MAKE IN INDIA? 
Energy and climate policies are closely tied 
to industrial policy. Even on a low-carbon 
energy pathway, total primary energy con- 
sumption in India will at least double by 
2030 (compared to 2011 levels). Energy 
efficiency alone — in industry, residential 
and commercial spheres — cannot mitigate 
climate change. 

Although unemployment rates in India are 
low (less than 5%) nearly 35% of employment 
is casual labour. The government's Make in 
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India campaign, launched in September 2014, 
calls for aggressive job creation through rapid 
growth in the industrial sector. 

Manufacturing consumes nearly one- 
third of India’s primary energy supply, and 
contributes to 16% of gross domestic prod- 
uct (GDP) and more than 20% of direct 
emissions*. These emissions would grow, 
should India achieve its target of 25% con- 
tribution to GDP from manufacturing. 

The best opportunity for decarboniza- 
tion, therefore, is the power sector — which 
contributes nearly 38% of overall emissions”. 
Here, renewable energy could account for 
about 30% of the electricity mix by 2030. 

In sectors such as metal production, non- 
metallic minerals, chemicals and textiles, 
which contribute most to manufacturing 
GDP, fuel accounts for 9-23% of all input 
costs compared to the industrial-sector aver- 
age of 5%. Energy efficiency and alternative 
fuels should play a key part in decarboniz- 
ing these sectors. India’s cement industry, for 
instance, is one of the world’s most efficient, 
but it also accounts for 7% of the country’s 
emissions. Here, technological advances 
such as refuse-derived fuels could save 
600 million tonnes of coal, 550 billion units 
of electricity and 3.4 gigatonnes of carbon 
dioxide emissions between now and 2050. 

A shift to a different industrial mix is 
required: away from such energy-intensive 
sectors and towards metal fabrication, man- 
ufacture of computers and electronics, elec- 
trical and mechanical machinery, advanced 
materials, biotechnology, nanotechnology 
and photonics. This would lower the energy 
footprint of India’s industrial growth. = 


Arunabha Ghosh is chief executive and 
Karthik Ganesan is senior research 
associate at the Council on Energy, 
Environment and Water, New Delhi, India. 
e-mail: arunabha.ghosh@ceew.in 
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Oliver Sacks in New York in 1961. 


AUTOBIOGRAPHY 


In search of self 
and science 


Tim Radford revels in Oliver Sacks’s memoir of his youth 
as a biker, druggie, muscle-builder — and scientist. 


young man sets out to find himself. 
A: discovers motorbikes, leather, 

speed and thrills, lives on a kibbutz 
and has a learning experience with a prosti- 
tute in Paris. (The climax is a shared pot of 
lapsang souchong tea.) In the United States, 
as a young medic, he takes to the road on his 
motorcycle: Easy Rider in the landscape of 
John Steinbeck. He also takes to marijuana, 
LSD, methamphetamine and morning-glory 
seeds. He has the appetite for science but not 
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the patience for research. He is drawn to the 
helpless, the hopeless and the lost, but man- 
ages to annoy the hell out of his peers, so he 
drifts from job to job. 

This is the stuff of a certain kind of mid- 
twentieth-century novel. It is also the youth 
chronicled by Oliver Sacks in what might be 
his final reminiscence, On the Move. If it is 
the last, it is the coda to an astonishing life: 
Sacks is a scientist, a doctor of medicine anda 
clinical consultant who has also had a brilliant 
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On the Move: A career as a best-selling 
Life author and man- 
Ee ncn about-neuroscience. 
Alfred A. Knopf: 2015. None of Sacks’s 
journey into mythic America was planned. 
The enthusiastic biker who first crossed the 
Atlantic from Britain in 1960 with vague 
dreams of joining the Royal Canadian Air 
Force or becoming a lumberjack instead 
achieved enduring recognition and status 
as professor of neurology at New York 
University. But you might not predict it 
from this account of uncertain beginnings 
and peripatetic adventures. 

Sacks, who had been a junior doctor at the 
Middlesex Hospital in London, became an 
intern of uncertain migrant status working 
with neurosurgeons at Mount Zion Hos- 
pital in San Francisco. He fell in love with 
California, and hung around Muscle Beach 
in Santa Monica, birthplace of the US gym 
revolution. Here as elsewhere, he did noth- 
ing by halves, consuming “five double 
cheeseburgers and half a dozen milkshakes 
per evening” to bulk up for his power lifts. 
He broke a weightlifting record (“a squat 
with a 600-pound bar on my shoulders”) 
and several bones. His capacity for alcohol 
matched his appetite for learning (at around 
17, deep in James Joyce's titanic 1922 novel 
Ulysses, he sipped his way through a litre of 
aquavit during the North Sea ferry cross- 
ing from Norway) and his ability to connect 
with others. 

That talent was to inform his understand- 
ing of patients, colleagues and readers. His 
friends have ranged from truck drivers and 
Hells Angels to polymath-director-scientist 
Jonathan Miller, geneticist Francis Crick, 
poet W. H. Auden — and Hollywood star 
and muscleman connoisseur Mae West, who 
chatted him up while he was moonlighting 
at a Los Angeles hospital. 

At the heart of this picaresque adventure 
is an unhappy secret. When Sacks admitted 
in 1951, aged 18, that he might prefer boys 
to girls, his mother called him “an abomi- 
nation” and wished that he had never been 
born. However, nothing ‘abominable’ had yet 
happened. That was to hit several years later, 
when, determined to lose his virginity, he 
headed to Amsterdam and its gay bars, but 
drank so much gin that he was unconscious 
during his deflowering. 

Sacks was not without connections and 
luck — a loving medical family in London, 
a scholarship to the University of Oxford, 
cousins including iconic US cartoonist 
Al Capp and Abba Eban, the Israeli scholar 
and diplomat. After the success of Sacks’s 
first book, Migraine (Vintage, 1970), his 
father joked that 
he no longer spoke 
of himself as “Abba 
Eban’s uncle” but as 
“the father of Oliver 


For more on Sacks in 
the Nature Podcast: 


DOUGLAS WHITE 


MARK GARLICK/SPL 


Sacks”. And can he write? Over his life, 
Sacks has filled 1,000 notebooks and 
journals, not counting journalism, 
medical notes and a lost suitcase full 
of photographs and notes — written in 
bars and restaurants, up mountains and 
in airports. He has more than a dozen 
books in print. Harold Pinter wrote a 
play inspired by his second book, Awak- 
enings (HarperPerennial, 1973); Penny 
Marshall directed the film. Awakenings 
also inspired a ballet, and Peter Brook 
directed a French theatre production 
of The Man Who Mistook His Wife For 
A Hat (Summit, 1985). Michael Nyman 
wrote an opera on the same work. 
Some of On The Move feels ripe for 
a US heavyweight such as the novelist 
James Baldwin. Other parts are untidily 
told, padded with extracts from letters 
home or the young adventurer’s first 
attempts at writing. It is not quite clear 
how that youthful, ready-for-anything 
medic meta- 


“This is a morphosed into 
compelling a distinguished 
front-line professor. We 
dispatchfrom _ piece the story 
halfacentury’s together from 
wonderful anecdotes of fool- 
exploration hardy adventure 
of brain, mind and episodes of 
and nervous clinical encoun- 
system.” ter. Sacks writes 


about people with 
migraines, Tourette’s syndrome or Par- 
kinson’s disease, autism, epilepsy, colour 
blindness, serious mental illness and the 
post-encephalitics of the Beth Abraham 
Hospital in the Bronx, New York, who are 
the subjects of Awakenings. These are the 
stuff of his books: not just medical cases, 
but warm, quirky and aware. 

This is another compelling front-line 
dispatch from halfa century’s wonderful 
exploration of brain, mind and nervous 
system. It is a valedictory memoir, and 
one with a tentatively happy ending. At 
76, this lonely writer (“it has sometimes 
seemed to me that I have lived at a cer- 
tain distance from life”) found enduring 
love. But the book’s text was handed to 
the publisher before Sacks, now 81, was 
diagnosed with cancer of the liver. He has 
just written about that in The New York 
Review of Books, and of — in the words 
of Friedrich Nietzsche — “a reawakened 
faith in a tomorrow and the day after 
tomorrow’. Here's hoping there may yet 
be an epilogue. m 


Tim Radford is a former science editor 
of The Guardian, and author of The 
Address Book: Our Place in the Scheme 
of Things. 

e-mail: radford.tim@gmail.com 
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After the cataclysm 


John Gilbey delights in a vast, technologically charged 
tale froma science-fiction supremo at the top of his game. 


eal Stephenson writes 
big science fiction, 
literally and figu- 


ratively. Weighing in at 
some 900 pages and 
stretching nearly 5,000 
years into humanity's 
future, Seveneves is no 
exception. 

It traces an epoch 
in which humankind 
and the environment 
change profoundly. The 
bulk of the novel is the lead- 
up to, and immediate aftermath 
of, a stunning cosmic event that 
leaves humanity teetering on the edge. The 
remainder describes a renaissance with only 
faint echoes of what we recognize as human 
culture. 

The cataclysm is the destruction of the 
Moon by a mysterious agent. As Earth 
is assaulted by a rain of debris from the 
shattered satellite, the vast majority of the 
human population faces oblivion. The core 
of the story relies on current, or currently 
anticipated, technologies — weaving a plau- 
sible tale of how a tiny number of survivors, 
the “seveneves” of the title, might secure a 
future for our species. Stephenson imagines 
the rebirth as a division into seven races, 
based on the genetic profiles of the found- 
ers. The future cultures have both old and 
new social problems, but also fresh insights 
and resources with which to address them. 

The epic injury to Earth looms in the very 
first sentence: a masterful attention-grabber. 
Stephenson maintains tension and energy, 
as well as a remarkable technical complex- 
ity, both literary and scientific. I repeatedly 
found myself sketching parts of the dramati- 
cally scaled mechanical constructs that enable 
later stages of the story — such as whip-like 
machinery to capture 
high-flying gliders 
and transfer them to 
Earth orbit — to judge 
whether they were fea- 
sible. They were. 

Comparisons with 
other sci-fi epics are 
inevitable. The Culture 
series by Iain M. Banks 
carries similar social 
and sexual complexities, 
massive terraforming 


Seveneves 

NEAL STEPHENSON 
William Morrow: 
2015. 
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Earth undergoes a catastrophic 
transformation in Seveneves. 


machinery and off-world 
habitats, and shares 
Stephenson's delight in 
clever characterization 
and off-beat humour. 
Arthur C. Clarke's The 
Fountains of Paradise 
(1979) embodies related 
technical solutions, and 
hasa postscript that makes 

a temporal and socio-biolog- 
ical leap of the same scale. Olaf 
Stapledon’s classic Last and First Men 

(1930) paints a similarly portentous picture 
of genetic manipulation, cosmic cataclysm 
and the potential future forms of humanity 
— albeit with a massively larger scope and 
extending forwards for millions of years. But 
what distinguishes Seveneves for me is Ste- 
phenson’s handling of the characters. There is 
an almost Malthusian detachment in how he 
introduces, builds, then violently dispatches 
characters who in novels with less robust rea- 
soning would be saved by a clever plot device. 

This is hard sci-fi in a real and welcome 
sense, ruled by unremitting physical laws, 
unlike the negotiable rules of the action 
thriller. People die because their deaths 
are inevitable, and many pass unremarked 
because the disaster’s scale is so vast. Their 
sacrifice is tied to the theme of engineering 
the survival of the human race. Science fic- 
tion often suffers from a disparity between 
the impressive scale of the scenery, and the 
size of the characters and how they are devel- 
oped. Stephenson balances these aspects well, 
avoiding cookie-cutter scientists and the all- 
too-common characterization of technolo- 
gists as brilliant but conflicted renegades. 

I did find myself mulling over the casting 
for the film that is sure to follow. Some- 
one needs to talk to Morgan Freeman’s 
agent, that’s all I’m saying. And an almost 
throwaway early scene is never quite 
resolved, making it clear that there is signifi- 
cant scope for sequels. I very much hope that 
Stephenson is working on them. m= 


John Gilbey is a science and science-fiction 
writer. He teaches in the department of 
computer science at Aberystwyth University, 
UK. 

e-mail: gilbey@bcs.org.uk 
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NEUROPHYSIOLOGY 


The man who bared the brain 


Alison Abbott encounters the discoveries of Renaissance anatomist Andreas Vesalius. 


Andreas Vesalius wrote as an angry 

young man in the 1540s that you get a 
feeling for what drove him to document 
every scrap of human anatomy his eye could 
see. His anger was directed at Galen, the 
second-century physician whose anatomical 
teachings had been held as gospel for more 
than a millennium. Roman Empire law had 
barred Galen from dissecting humans, so 
he had extrapolated as best he could from 
animal dissections — often wrongly. 

Human dissections were also banned in 
most of sixteenth-century Europe, so Vesa- 
lius travelled to wherever they were allowed. 
He saw Galen’s errors and dared to report 
them, most explicitly in his seven-volume 
De Humani Corporis Fabrica (On the Fabric 
of the Human Body), which he began aged 24, 
working with some of the best art profession- 
als of the time. His mission to learn through 
direct and systematic observation marked the 
start of a new way of doing science. 

In Brain Renaissance, neuroscientists 
Marco Catani and Stefano Sandrone pre- 
sent a translation from the Latin of the 
Fabrica’s last volume, which focuses on the 
brain. Through it we can appreciate Vesa- 
lius’s extraordinary attention to detail, and 
his willingness to believe his eyes, even 


L is only when you read the words that 
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when what he saw contradicted established 
knowledge. We learn his anatomical vocab- 
ulary. For example, he called the rounded 
surface protuberances near the brain stem 
“buttocks” and “testes”; these are now known 
as the inferior and superior colliculi, or ‘little 
hills; which process sound and vision. 

We hear Vesalius — who died at 49 in 
1564, the year Galileo was born — berating 
Galen and his sixteenth-century followers 
with unrestrained sarcasm. But we also see 
aman not quite prepared to put into plain 
words the theological consequences of some 
of his discoveries, particularly his failure 
to find an anatomical explanation for the 
‘human spirit’ in the brain. 

Brain Renaissance is not the only English 
translation, but it is the only one available at a 
price that individuals might afford. Accompa- 
nying texts by Catani and Sandrone place the 
work in its historical and scientific context; 
a biography of Vesalius is rich in elements 
familiar to scientists today, such as the fear of 
plagiarism and pernicious academic rivalry. 
And a brief final section on the history of 
neuroscience warns against the temptation 
to move away from direct observation into 
overly abstract theory. 

Born into a well-to-do Brussels family of 
physicians and pharmacists, many of whom 
attended royalty, Vesalius studied medicine in 
Paris. When he was 18, his teachers allowed 
him the extraordinary privilege of assisting 
in their occasional public dissections of 
executed criminals. He continued 
his studies in Leuven, now in Bel- 
gium, where he persuaded the 
mayor to allow human dissec- 
tion. On graduation, he was 
offered a professorship in 

anatomy at the University 
of Padua, Italy — an intel- 
lectual hotbed politically 
independent of the Pope, 
where the practice of 
human dissection was 
long established. 
Padua is close 
to Venice, which 
was home to 
important 
schools of artists. 
Vesalius recruited 


One of Vesalius’s illustrations 


of an exposed brain from De 
Humani Corporis Fabrica, 1543. 
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members of Titian’s 
workshop to attend 
his dissections and 
provide the Fabrica’s 
exceptional illustra- 
tions. In the first two 
volumes, skeletons and 
flayed figures pose in 
romantic landscapes 
full of classical ico- 
nography. Figures in 
the other volumes are 
less ornate, but clear 
and fine. The brain is 
often shown encased 


Brain 
Renaissance: 
From Vesalius 
to Modern 
Neuroscience 
MARCO CATANI AND 


STEFANO SANDRONE 
Oxford Univ. Press: in the skull with the 
2015. top removed, revealing 


cross-sections at dif- 
ferent depths; other images depict individual 
brain structures such as the cerebellum. 
Vesalius’s thinking was as influenced 
by prevailing technologies as ours is, and 
as Galen's was, note Catani and Sandrone. 
Whereas today we draw on computer and 
social networks for metaphors about brain 
function, for Vesalius and Galen the tech- 
nology of reference was hydraulics, which 
almost miraculously kept the waterways and 
plumbing systems of their cities functioning. 
Both saw the brain in these terms, with the 
functional units being the liquid-containing 
ventricles rather than grey and white mat- 
ter. Galen held that the vivifying force of the 
pneuma physicon, or ‘animal spirit’, flowed 
down through the ventricles, then through 
hollow nerves to nourish all parts of the 
body. Vesalius ruled this out on anatomical 
grounds: he showed that there is no physical 
outlet through the skull. But he still searched 
for flow routes that would, for example, 
funnel ‘brain phlegm into the nostrils. 
Vesalius was aware of the value of his work, 
and of the academic jealousies that could 
work against him. He chose not to use a Vene- 
tian printer who was producing a rival, Galen- 
based anatomical tome, perhaps because he 
feared that the printer would deliberately 
delay publication of his own study. Instead he 
crossed the Alps to Basel, Switzerland, where, 
still fearing intellectual theft, he stayed to 
oversee the printing. He whiled away his time 
by boiling the body of an executed murderer 
to get at his bones. The reassembled skeleton 
is still displayed in the city’s university. m 
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IPCC: social 
scientists are ready 


Social scientists are ready to 
work as full partners with 
physicists and ecologists on 
climate-change assessments by 
the Intergovernmental Panel 
on Climate Change (IPCC), 
government agencies and other 
organizations (D. Victor Nature 
520, 27-29; 2015). 

Social scientists are already 
involved in climate discussions 
to some extent through societies 
of various disciplines (see, for 
example, go.nature.com/cixl9y). 
For more than 20 years, the 
US Board on Environmental 
Change and Society and the 
former International Human 
Dimensions Programme 
on Global Environmental 
Change have been examining 
the implications of social- 
science research for global 
environmental change — and 
vice versa (see, for example, 
go.nature.com/24t7mf). 

However, social scientists 
can do more to engage with 
climate change by applying their 
disciplines and investigating 
human -environmental 
interactions that have their own 
dynamics. Further investment 
will help to develop integrated 
social and environmental data 
sets to support these analyses. 
These efforts will allow social 
scientists to test and expand 
the scope of their concepts 
and methods to contribute 
important insight into such 
relatively new issues. 

Paul C. Stern National Research 
Council, Washington DC, USA. 
Thomas Dietz Michigan 

State University, East Lansing, 
Michigan, USA. 
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IPCC: calling social 
scientists of all kinds 


The invitation to contribute 
more to the Intergovernmental 
Panel on Climate Change 
(IPCC) should not be limited 
to social-science fields that 


are immediately relevant to 
decision-making processes, as 
David Victor seems to imply 
(see Nature 520, 27-29; 2015). 

A wider mix of social 
scientists can more effectively 
contribute valuable knowledge 
to the often-contentious 
societal and policy issues 
around climate change. 

They will help to retain and 
strengthen the role of the IPCC 
in international policy-making 
and public discourse by using 
qualitative and quantitative, 
theoretical and empirical, basic 
and applied approaches. 

This should not be difficult: 
most international social- 
science associations have 
working groups on climate and 
environmental change, and 
integrated science platforms 
such as Future Earth involve 
hundreds of social scientists 
worldwide (www.futureearth. 
org). These social scientists 
should be equal partners in the 
IPCC’s framing and scoping 
processes, along with their 
natural-scientist colleagues and 
policy-makers. 

Mathieu Denis International 
Social Science Council, Paris, 
France. 

Susanne C. Moser Future Earth, 
Santa Cruz, California, USA. 
mathieu@worldsocialscience.org 


Plant identification is 
key to conservation 


Isabel Marques’ call to update 
traditional botany teaching 
beyond plant morphology 
seems to devalue the 
importance of taxonomy and 
systematics (Nature 520, 295; 
2015). The identification of new 
plant species is still as relevant 
today as the discovery of new 
genes and gene functions — 
and is crucial for conservation 
efforts in developing countries. 
Enormous numbers of 
plant species in Brazil and 
Madagascar, for example, 
still await formal description. 
The skills needed to meet this 
challenge and their ability to 


attract funding should not 
be dismissed. The Brazilian 
government is already providing 
funding, although it will be a 
long time before the Malagasy 
government can do so. 
The botanical community 
is fast filling the gap 
between herbarium work 
and metagenomics, despite 
dwindling funding (see also 
M. Kemler Nature 521, 32; 2015). 
A good place to start reviving 
interest in a botanical education, 
and hence strengthen this 
community, would be to include 
striking plant species such as 
Rafflesia and Sarracenia in 
televised nature programmes, 
rather than focusing on 
charismatic megafauna. 
Anna Trias-Blasi, Maria 
Vorontsova Royal Botanic 
Gardens, Kew, UK. 
a.triasblasi@kew.org 


Citizen science is not 
enough on its own 


Citizen scientists’ important 
contributions to biodiversity 
conservation are constrained 
by their focus on data 
collection and public outreach 
in wealthy, accessible places. 
Sustainable conservation 
actions require initiatives 
such as those supported by 
the Participatory Monitoring 
and Management Partnership 
(www.pmmpartnership.com), 
in which data collected by land 
owners and resource users help 
to guide local decision-makers 
on conservation management. 
Citizen scientists do not 
formulate research questions, 
analyse data or implement 
management solutions on 
the basis of research findings. 
By contrast, participatory 
monitoring by local and 
indigenous communities in 
tropical, Arctic and developing 
regions enables them to propose 
solutions for environmental 
problems, advance sustainable 
economic opportunities, 
exert management rights 
and contribute to global 


environmental data sets. 

Such monitoring could 
benefit from the large-scale 
databases and knowledge 
integration pioneered by citizen 
science. Conversely, citizen 
science could benefit from the 
community-based monitoring 
practices used to build data- 
collection methods, analytical 
tools, communication networks 
and skilled workforces in 
culturally appropriate, place- 
based governance structures. 
Rod Kennett Australian Institute 
for Aboriginal and Torres Strait 
Islander Studies, Canberra, 
Australia. 

Finn Danielsen NORDECO, 
Copenhagen, Denmark. 

Kirsten M. Silvius Virginia Tech 
University, Blacksburg, Virginia, 
USA. 

rod.kennett@aiatsis. gov.au 


Check the rejects 
for fame bias too 


Omid Mahian and colleagues 
suggest that scientific celebrity 
does not seem to influence the 
editors’ choice of Correspondence 
items for publication (Nature 519, 
414; 2015), but their informal 
analysis has caveats beyond those 
they mention. 

Famous authors may be more 
disposed to write letters to the 
editor than is the rest of Nature’s 
readership, for example. Any 
analysis for editorial bias should 
include data from rejected 
contributions as well (which 
were obviously unavailable to 
the authors). Then, applying a 
logistic regression or calculating 
an index of niche breadth, such 
as the proportional similarity 
index (P. Feinsinger et al. Ecology 
62, 27-32; 1981), would clinch 
the matter. 

Perhaps a simpler 
demonstration ofa lack of 
editorial bias might be the 
publication of this note, signed 
by a student who has no 
publishing record at all. 

Davide Nespoli University of 
Milan, Italy. 
davide.nespoli@unimi.it 
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The slow death of red galaxies 


For most galaxies, the shutdown of star formation was a slow process that took 4 billion years. An geal of 
some 27,000 galaxies suggests that ‘strangulation’ by their environment was the most likely cause. TE 


ANDREA CATTANEO 


process that takes about four minutes. 

During this time, the victim uses up oxy- 
gen in the lungs but keeps producing carbon 
dioxide, which remains trapped in the body. 
On page 192 of this issue, Peng et al.’ present 
evidence of an analogous slow ‘strangula- 
tion’ process that ends the formation of stars 
in many galaxies by disrupting the supply of 
gas that accretes onto those galaxies from the 
environment. Instead of building up CO,, 
the strangled galaxies accumulate metals 
— elements heavier than helium — produced 
by massive stars. 

Almost 90 years ago, Edwin Hubble 
classified galaxies into three morphological 
types: spirals, ellipticals and lenticulars (Fig. 1). 
In spirals, stars form a disk and turn around 
in circles like the horses on a cosmic merry- 
go-round. Ellipticals are the wrecks of galaxy 
crashes, in which stars move chaotically in all 
directions. Lenticulars form an intermediate 
type between the two. Most spirals are blue 
because they contain young blue stars, but 
elliptical and lenticular galaxies contain little 
or no cold gas to make new stars, and so only 
old red stars remain. Historically, astronomers 
have been more interested in the morpholo- 
gies of galaxies than in their colour. Atten- 
tion switched to colour after the Sloan Digital 
Sky Survey (SDSS) measured the spectra of 
hundreds of thousands of galaxies. 

The SDSS demonstrated that blue (star- 
forming) and red (passive) galaxies form 
distinct populations’. Since then, various 
hypotheses have been put forward to explain 
what causes galaxies to transition from one 
type to the other. Most revolve around two 
ideas. The first is that the gas in ellipticals and 
their surroundings is too hot to make stars and 
does not cool efficiently. The second is that the 
gas that could cool and make stars is kept hot 
or blown away by phenomena linked to the 
growth of supermassive black holes, which 
are found at the centres of all ellipticals’. The 
main motivation for considering violent 
expulsion scenarios comes from computer sim- 
ulations of the formation of ellipticals in galaxy 
mergers. These simulations need a mecha- 
nism that gets rid of gas to avoid forming blue 


E humans, death by strangulation is a slow 
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oe (eae) 
Figure 1 | Colours and masses of galaxies. Blue galaxies produce new stars, whereas red galaxies do not. 
Spiral galaxies generally have relatively low masses (usually lower than 10'°° times the mass of the Sun). 
Massive lenticular or elliptical red galaxies have masses approximately 10'°°-10''* times the mass of the 
Sun. Any process that quenches star formation will shift galaxies from the blue to the red population. 
Peng et al. present evidence for a slow process, called strangulation, that shuts down star formation and 
modifies the chemical composition of galaxies, but preserves their mass and structure. 


cores’, which are rare in real ellipticals. 

Peng et al. present evidence that the forma- 
tion of stars in most passive galaxies ended 
through a slow strangulation process. The 
authors compared the metal content of the 
stars in approximately 23,000 passive galaxies 
from the SDSS with that ofa control sample of 
about 4,000 star-forming galaxies, also from 
the SDSS. They discovered that the metal con- 
tent of the former is systematically larger than 
that of the latter, at least for galaxies that have 
stellar masses up to 100 billion times the mass 
of the Sun, the limit mass (M,) above which 
galaxies become scarce. This constitutes evi- 
dence for galactic ‘suffocation; in the same way 
that high levels of CO, in the blood of a corpse 
suggest suffocation. 

From the difference in the metal content of 
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the stars of passive and star-forming galaxies, 
Peng et al. inferred a delay of 4 billion years (or 
2 billion years for galaxies close to M,) between 
the time that gas stopped being supplied and 
the time that star formation ended. This delay 
is consistent with the mean age difference 
between passive and star-forming galaxies 
(about 4 billion years at all masses). 

As any forensic scientist will tell you, suf- 
focation does not imply strangulation. But 
the difference in metal content is higher for 
galaxies in groups and clusters than it is for 
isolated galaxies, suggesting that crowded 
environments strangle galaxies by disrupting 
the accretion of gas onto them. This disruption 
might occur either through ram pressure (the 
pressure exerted on a body moving through a 
fluid medium) or through tidal forces. 


IMAGES: A. CATTANEO; SOURCE DATA: SDSS 


The slow shutdown of star formation 
inferred by Peng et al. from observations of 
galaxies with masses lower or equal to M, 
contrasts with the fast shutdown behaviour of 
much larger galaxies, such as giant ellipticals. 
The stars of giant ellipticals have a low iron 
content because they were made on a short 
time span (less than 0.3 billion years for a 
galaxy of mass greater than 3 M,)° — that is, 
there was not enough time for many type Ia 
supernova explosions, the source of iron. 
Because there are many more red galaxies 
below M, than there are above, Peng et al. are 
correct to argue that strangulation is the main 
mechanism for star-formation shutdown. But 
the different chemical properties of low- and 
high-mass red galaxies imply that they must 
have formed through different routes. Morph- 
ological differences back this interpretation: 
red galaxies with masses above M,/3 are all 
ellipticals or lenticulars; but below M,/3, 40% 
of them are red spirals*, as would be expected 
if strangulation has occurred. 

There is also a difference between ellipticals 
and lenticulars. Most elliptical galaxies that have 


stellar masses in the range of 1M, to 2M, were 
already in place at a redshift of between 2 and 3, 
a period when the Universe was about one-fifth 
of its present age. Lenticulars appeared more 
gradually, replacing a pre-existing population 
of spiral and irregular galaxies (M. Huertas- 
Company, personal communication). The ratio 
of ellipticals to lenticulars is approximately 3:5 
for galaxies of approximately 1.5 M,, but lenti- 
culars should predominate in the mass range 
explored by Peng and colleagues. Further evi- 
dence for the presence of two evolutionary 
tracks has been obtained by comparing the 
evolution of the star-formation rate of galaxies 
with the evolution of the galaxies’ size’. 
Cosmological models* of the formation and 
evolution of galaxies predict two mechanisms 
by which star formation shuts down. In these 
models, galaxies more massive than 1.5 M, 
reside at the centres of groups and clusters. 
They grow extremely rapidly until, at a redshift 
of about 3 (which corresponds to when the 
Universe was one-quarter of its current size), 
they attain the critical mass at which infalling 
gas is effectively shock-heated. Some violent 


Internal compass puts 
flies in their place 


An analysis reveals that fruit-fly neurons orient flies relative to cues in the insects’ 
environment, providing evidence that the fly’s brain contains a key component 
for drawing a cognitive map of the insect’s surroundings. SEE ARTICLE P.186 


THOMAS R. CLANDININ & LISA M. GIOCOMO 


nimals need accurate navigational 
skills as they go about their every- 
day lives. Many species, from ants to 
rodents, navigate on the basis of visual land- 
marks, and this is complemented by path 
integration, in which neuronal cues about the 


animal’s own motion are used to track its loca- 
tion relative to a starting point. In mammals, 
these different types of navigation are inte- 
grated by neurons called head-direction cells’. 
In this issue, Seelig and Jayaraman’ (page 186) 
provide the first evidence that certain neurons 
in fruit flies have similar properties to head- 
direction cells, encoding information that 
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phenomenon then quenches star formation. 
However, the models also predict that galaxies 
below 0.6 M, become red much later (at a red- 
shift of less than 0.5) and much more gradu- 
ally, in most cases because they stop accreting 
gas after becoming part of a group or a clus- 
ter. Thanks to Peng and colleagues’ work, this 
second theoretical prediction is now an obser- 
vational fact. m 


Andrea Cattaneo is in the Observatoire de 
Paris, GEPI, 75014 Paris, France. 
e-mail: andrea.cattaneo@obspm.fr 
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orients the insects relative to local landmarks. 

Head-direction cells act as a neuronal 
compass that generates a cognitive map of 
an animal’s environment. The activity 
of each head-direction cell increases as the 
animal faces a particular direction, with 
different cells preferentially responding to dif- 
ferent directions’*. Rather than certain cells 
always responding to north, south and so on, 
the direction in which the cells fire is set up arbi- 
trarily when the animal encounters new visual 
landmarks. The signals are then updated by self- 
motion cues as the animal navigates. Studying 
head-direction cells in mammals is challenging 
because of the complexity of the mammalian 
brain. By contrast, the small fly brain is a good 
model for studying neuronal activity. 

Seelig and Jayaraman recorded the activity 
of a specific population of neurons in the 
fruit-fly central complex, a region that spans 
the midline of the brain and which coordi- 
nates movement with visual cues. The authors 


Figure 1 | A shared view of navigation. Mammals navigate on the basis of a combination of visual (blue dotted arrow) and self-motion cues. These cues are 
integrated by head-direction neurons to generate an internal compass (depicted in red) that helps the animal to orient itself in its surroundings. Seelig and Jayaraman” 
report that the neurons of fruit flies also process visual and self-motion cues to enable navigation, suggesting that flies, too, may have a cognitive compass. 
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placed flies on a ball in a virtual-reality arena 
of visual displays, and fixed the insects’ heads 
in place. The flies explored the arena by walk- 
ing on the ball, and the authors monitored 
turning behaviour (calculated from the ball’s 
movements) and neuronal activity. Neurons 
in the central complex showed highly tuned 
responses that encoded the fly’s orientation 
relative to a visual cue from the arena. The 
researchers showed that such orientation 
responses are generated by a population of 
highly active neurons, which together form 
a single ‘bump’ of neuronal activity that 
marks the direction in which the fly is facing 
relative to the visual cue. This activity bump 
seamlessly rotates around a ring of neurons 
in concert with the fly’s location relative to 
the cue, with changes in the cue’s location 
eliciting a concordant shift in the position of the 
activity bump. 

Do the orientation responses encoded by fly 
neurons reflect the convergence of self-motion 
and landmark cues, as in head-direction cells? 
In support of this theory, increasing the com- 
plexity of the visual cue did not affect the ori- 
entation response. This indicates that neuronal 
activity predominantly reflects the fly’s loca- 
tion relative to the visual landmark, rather than 
being a specific feature of the cue itself. 

Under normal circumstances, animals 
integrate both self-motion and landmark cues. 
However, if one type of cue is missing, the ani- 
mal must rely on the other. Seelig and Jayara- 
man allowed flies to explore the virtual-reality 
arena in the dark, thus eliminating landmark 
cues. Orientation responses remained stable, 
indicating that the response can be driven by 
self-motion, but they drifted slowly over time, 
showing that the accuracy of the response 
depends on both types of cue. Moreover, the 
fact that flies preserve a neuronal memory of 
orientation even in the absence of the visual 
cue that created it demonstrates that their navi- 
gational behaviour is much more than a simple 
sensorimotor reflex. 

Thus, the current study points to many 
parallels between the orientation responses in 
flies and mammalian head-direction signals’. 
Both show increased activity when the animal 
faces a particular direction, irrespective of 
its location at the time. Both are maintained 
by internal self-motion cues, with familiar 
visual cues establishing each cell’s direction- 
ality’. And, as in flies, orientation in rodents 
remains coherent in the dark, but drifts slowly 
over time**. These similarities demonstrate 
the sophistication of the insect navigational 
system. The mammalian system is thought 
to help generate an internal representation of 
space by acting as a neuronal compass (Fig. 1)°. 
The possibility that the fly brain contains at 
least one of the components needed to create 
a similar cognitive map is intriguing. 

This study also improves our under- 
standing of the mechanisms underlying 
orientation responses in both invertebrates 
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and vertebrates. Head-direction cells are 
hypothesized’ to arise from networks of 
neurons, dubbed ‘attractor networks. These 
networks are thought to adopt a ring-like 
architecture that allows a single activity bump 
to move around the network in concert with 
an animal’s movement. Although some evi- 
dence consistent with this hypothesis has been 
observed in the past year in small numbers of 
rodent neurons”, large-scale dynamics remain 
difficult to observe in mammalian neuronal 
networks. Seelig and Jayaraman observe classic 
traits of a ring-like attractor network in flies, 
including coherent activity in the absence of 
visual inputs, and slow error accumulation? 
— some of the most direct evidence to date 
for attractor networks as the mechanism that 
generates orientation responses. The possibil- 
ity that ring-like attractor networks are evolu- 
tionarily conserved raises the exciting prospect 
that similar internal computational principles 
are used to calculate orientation in disparate 
species. 

However, the role of self-motion compared 
to translational movement — movement of an 
animal’s body position through the environ- 
ment relative to the cue — in driving orienta- 
tion responses remains a mystery. In rodents, 
head-direction responses remain coherent 
despite translational movement. For example, 
ifa head-direction cell becomes established as 
north-preferring in response to a visual cue to 
the northeast, it will remain tuned to the north 
even if the animal moves such that the visual 
cue is now to its southeast. The flies in Seelig 
and Jayaraman’s preparation, however, do not 
experience this type of translational move- 
ment, because the fly’s body position relative 
to the cue remains fixed owing to the design 
of the virtual-reality system. It will be interest- 
ing to determine how fly orientation neurons 
respond to translation. 


MICROSCOPY 


The authors’ work provides insight into the 
neuronal basis of navigation. It was already 
known that insects use path integration”, use 
polarized light to orient relative to the Sun” 
and show behaviours indicative of spatial 
memory”. But Seelig and Jayaraman provide 
the first evidence for one of the components 
required to construct a cognitive map — a 
computation previously thought the preserve 
only of vertebrates. This finding paves the 
way for dissecting the neuronal basis of navi- 
gation by leveraging the powerful genetic 
tools and tractable neuronal circuits available 
in flies. m 


Thomas R. Clandinin and Lisa M. Giocomo 
are in the Department of Neurobiology, 
Stanford University, Stanford, California 
94305, USA. 
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giocomo@gmail.com 
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Quantum control 
of free electrons 


Optical pulses have previously been used to place the electrons in the beam of an 
electron microscope into well-defined energy states. These electrons can now be 
put in a quantum superposition of those states. SEE LETTER P.200 


MATHIEU KOCIAK 


uantum mechanics is a daily affair for 
electron microscopists. The research- 
ers routinely focus their microscopes’ 
electron beams on samples to create quantum 
interference patterns that reveal information 
about the samples’ atomic and molecular 
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structure. In an exciting paper in this issue, 
Feist et al.' (page 200) demonstrate how they 
have used light focused on a nanostructure to 
make the fast electrons of one such electron 
beam exhibit another type of quantum behav- 
iour — Rabi oscillations. 

Quantum systems have properties that 
their classical counterparts lack. For two-state 
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Figure 1 | Quantum superpositions. a, In a two-level system, an electron can be in either of two 
stationary states (0 or 1). An applied strong, oscillating electromagnetic field (black line) can put the 
electron into a coherent superposition of states. The probabilities of detecting the electron in either of the 
states oscillate with increasing strength of the field (Rabi oscillations). b, The freely propagating electrons 
in the electron beam of an electron microscope, such as that studied by Feist et al. ‘can be placed in a large 
set of stationary states using an appropriately shaped electromagnetic field: state 0, which is determined 
by the nominal accelerating voltage of the microscope; states —1 to — N, which have the energy of state 0 
minus integer multiples of the electromagnetic field’s energy; and states 1 to + N, which have the energy of 
state 0 plus integer multiples of the field’s energy. For a strong field, each electron is put in a superposition 
of these states, and the probability that the electron is in any of these states oscillates with the field strength 
(unconventional, multilevel Rabi oscillations). For simplicity, only a subset of the oscillations is shown. 


(Plot in b adapted from ref. 1.) 


quantum systems, Rabi oscillations between 
the two, otherwise stationary, states of the 
system are one such property. To understand 
these oscillations, consider an electron in an 
atom that has two energy states, a ground 
state and an excited state (Fig. 1a). The elec- 
tron will stay in one of the states forever if no 
external perturbation is applied to it. But if, 
for example, an electromagnetic field is applied 
to the system, the probability of finding the 
electron in either of the states will oscillate 
with time and the strength of the electromag- 
netic field: these are the Rabi oscillations. 
The oscillation frequency, called the Rabi 
frequency, depends mainly on the strength 
of the perturbation. Stopping the oscillations 
at any time leaves the electron in a coherent 
quantum superposition; that is, ina combina- 
tion of both stationary states at the same time. 
Rabi oscillations are therefore commonly used 
to prepare quantum systems in a superposi- 
tion of states. Such superposition provides a 
means of encoding information in quantum- 
information technologies. 

In the electron microscope used by Feist 
et al., the freely propagating electrons of the 
electron beam can be placed in a large set of 
stationary states” (Fig. 1b). If Rabi oscillations 
between these states are to be observed, the 
probability of the electrons being in a given 
energy state must be measured. This prob- 
ability measurement can be easily done using 


an electron energy-loss spectrometer (EELS), a 
device that can be added to an electron micro- 
scope. It is usually used to produce a spectrum 
of the electron intensity that is transmitted 
through the sample under study as a function 
of the energy loss caused by scattering from the 
sample. The EELS spectrum of freely propa- 
gating electrons consists of peaks centred at 
the energy of the electrons, and measuring 
the height of the peaks gives the probability of 
finding them in these energy states. 

However, putting freely propagating elec- 
trons in a coherent quantum superposition 
is much more difficult than measuring their 
probabilities of being in a given state. Unlike 
an electron bound to an atom, a fast electron 
moving in free space will generally not couple 
to an electromagnetic field such as an optical 
plane wave. But it is well established that such 
an electron can easily couple to an evanescent 
wave — a wave that does not propagate but that 
decays exponentially with distance from the 
boundaries of objects, such as nanostructures, 
at which they are formed. Electron microscopy 
is regularly used for imaging optical excitations 
in nanostructures at high spatial resolution 
through the nanostructures’ evanescent fields’. 

By using synchronized femtosecond 
(1 femtosecond is 10°’ seconds) pulses of 
electrons and photons in an electron-micro- 
scope set-up, researchers have previously 
coupled photons to fast electrons through 
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the evanescent light field of a nanostructure 
(an individual carbon nanotube or a silver 
nanowire)’. They showed that, in such cases, 
electrons could absorb or emit photons many 
times, resulting in spectra of electron-energy 
loss (or gain) consisting of a series of peaks 
evenly spaced according to the energy of the 
photons. The peaks’ intensity, and thus the 
probability of finding an electron in a given 
state, decreased monotonically with the energy 
of the electron loss (or gain). Soon after this 
remarkable experiment, others predicted* that, 
using a similar experimental set-up, quantum 
superpositions of freely propagating electron 
states should be observable for longer-lasting 
photon pulses as a series of peaks with oscillat- 
ing energies of electron loss (or gain). 

And this is exactly what Feist et al. have 
demonstrated experimentally in their study. 
The authors observed a quantum super- 
position of freely propagating electron states 
by focusing femtosecond electron pulses ona 
sharp, nanometre-sized gold tip illuminated 
by picosecond laser pulses (1 picosecond is 
10°"’s). The tip turned the laser’s plane wave 
into an evanescent one, allowing strong 
coupling of the large number of freely propa- 
gating electron states. Each state’s probabil- 
ity, encoded in the intensity of each peak in 
an EELS spectrum, oscillates at its own pace, 
leading to unconventional, multilevel Rabi 
oscillations (Fig. 1b). 

Feist and colleagues’ experimental achieve- 
ment lies largely in the development of an elec- 
tron gun that generates electron pulses with high 
brightness. Such an electron gun allows a rela- 
tively narrow electron beam (one with a diam- 
eter of about 15 nanometres) to be formed. A 
wider beam would average the evanescent field 
in such a manner that the Rabi oscillations and 
the coherent quantum superposition would be 
lost. By experimentally introducing the field of 
free-electron quantum optics, this work projects 
free electrons into the world of quantum infor- 
mation — although it will be technologically 
demanding to transform an electron micro- 
scope into a quantum-information processor. 
However, as Feist et al. also demonstrated, the 
control of the beam’s freely propagating elec- 
trons leads to effects other than quantum super- 
position, including the formation of a train of 
attosecond (one billion-billionth of a second) 
electron pulses. Such pulses could find appli- 
cations in ultrafast electron spectroscopy and 
microscopy. 
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Fungus against the wall 


Acompound derived from plant cell- wall material that is a waste product of biofuel 
manufacture has been found to have fungicidal properties: it interacts witha 
carbohydrate called 81,3 glucan, thus compromising the integrity of fungal cells. 


PAUL O’MAILLE 


agriculture, and as the world’s population 
grows, a rising tide of resistance to anti- 
fungal agents is threatening global food secu- 
rity. This threat has led to an intensification in 
the use of fungicidal agrochemicals, which may 
themselves put human health at risk and dam- 
age the environment. New fungicidal agents 
are therefore desperately needed; and now it 
seems that help is on the way, but from unlikely 
sources. Writing in Proceedings of the National 
Academy of Sciences, Piotrowski and colleagues’ 
report the discovery of molecules in the waste 
products of biofuel manufacture that possess 
antifungal activity against plant pathogens. 
Dwindling oil reserves have spurred 
the development of technologies that use 
lignocellulosics — dry plant matter such as 
grass — as a renewable source of sugar to 
produce biofuels and bio-based chemicals. 
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A key step in the process is the hydrolysis of 
polysaccharides in the plant cell wall, which 
liberates sugar monomers so they can be fer- 
mented by yeast” (Fig. 1a). But other molecules 
are also liberated by hydrolysis, including 
diferulates, metabolites of the phenylpropa- 
noid family of natural products. These mol- 
ecules crosslink polysaccharides to each other, 
and to lignin polymers, to stiffen the plant cell 
wall, but the waste products stunt the growth of 
the yeast, inhibiting the fermentation process. 
Piotrowski and colleagues saw this problem as 
an opportunity for ‘bioprospecting, reasoning 
that these molecules might be an untapped 
resource of antifungal agents. 

The authors screened diferulates for 
antifungal activity by testing the molecules’ 
ability to inhibit growth of a fungus, the bud- 
ding yeast Saccharomyces cerevisiae. They 
identified two active compounds that inhibited 
growth, the most potent of which they dubbed 
poacic acid because of its prevalence in grasses 
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Figure 1 | Waste of a fungicide. a, Grasses of the Poaceae family have cell walls fortified with 
polysaccharides that are crosslinked with metabolites called diferulates, which act as an anchor for the 
formation of lignin polymers (not shown). Bioprocessing of such grasses liberates the polysaccharides 

for hydrolysis to sugar monomers, from which yeast can produce biofuels and bio-based chemicals. The 
process also liberates diferulates such as poacic acid as waste products. b, Piotrowski et al.' report that 
poacic acid has antifungal activity against yeast cells. The compound attacks the carbohydrate 61,3 glucan, 
which acts as a fortification in the yeast cell wall. As a result, cells become leaky and adopt abnormal shapes. 
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of the Poaceae plant family. Importantly, 
poacic acid had comparable antifungal activ- 
ity to two widely used agricultural fungicides. 

Next, Piotrowski et al. investigated how 
poacic acid exerts its effect, by challenging a 
pooled mixture of approximately 4,000 yeast 
strains with the compound. Each strain carried 
a different gene deletion that was marked by a 
unique molecular barcode — a short sequence 
of DNA that does not affect the cell’s function 
but can be used to identify the strain. After 
exposing the cells to poacic acid, the authors 
took a head count with next-generation DNA 
sequencing, measuring the frequency of each 
barcode in the population to determine the 
relative sensitivity or resistance of each strain 
to the fungicide’. The result was clear: dele- 
tion of genes involved in cell-wall organization 
conferred poacic-acid sensitivity, suggesting 
that the compound affects the yeast cell wall, 
thus mirroring the mechanism of many known 
antifungal compounds’. 

The authors then tested poacic acid in com- 
bination with two antifungal compounds that 
compromise cellular integrity’ and that are 
normally used to treat human fungal infec- 
tions: fluconazole, which targets the biosyn- 
thesis of ergosterol, an essential component of 
the fungal cell membrane; and capsofungin, 
which inhibits’ the biosynthesis of 61,3 glucan, 
a polysaccharide that acts as both a barrier and 
structural support in the fungal cell wall. Per- 
haps not unexpectedly, poacic acid acted in 
synergy with these compounds, indicating that 
each drug strikes the same or related pathways, 
albeit through different targets’. 

To further investigate how poacic acid 
alters the cell wall, Piotrowski et al. used high- 
dimensional microscopy to study yeast cells 
treated with poacic acid. The neck between 
the yeast cell and the daughter that buds off 
it was unusually wide in treated cells, and the 
degree of morphological variability in the cell 
population was higher than normal — similar 
changes to those caused by other drugs that 
disrupt the cell wall’ (Fig. 1b). Poacic acid also 
caused rapid influx of propidium iodide dye 
into the cell, consistent with cell-wall disrup- 
tion. Ina stroke of resourcefulness, the authors 
used the natural fluorescence of poacic acid 
to study its location within the cell. The cell’s 
surface lit up under the microscope, suggest- 
ing that poacic acid interacts directly with the 
1,3 glucan layer. The researchers then incu- 
bated poacic acid with purified fungal B1,3 glu- 
can — the glucan particles began to fluoresce, 
confirming the interaction. 

Given these promising results, the researchers 
next tested the potency of poacic acid against 
key agricultural fungal pathogens. Pre- 
liminary studies showed that poacic acid 
inhibited the fungi Sclerotinia sclerotiorum 
and Alternaria solani in a dose-dependent 
manner and protected soya-bean leaves 
from the oomycete (fungus-like) pathogen 
Phytophthora sojae, suggesting that poacic 


acid has broad-spectrum activity. 

Although the ability of poacic acid to target 
61,3 glucan and to disrupt cell walls in yeast is 
clear, more work is needed to establish the extent 
to which these effects are mirrored in agricul- 
tural pathogens and to gain deeper insights into 
the biochemical mechanism of action of poacic 
acid. Metabolomics (the study of metabolites 
within an organism, cell or tissue) provides a 
powerful tool with which to examine specific 
changes in the cellular biochemicals induced by 
drug action, and it has been successfully used to 
study pesticides’. When applied to poacic acid 
and other antifungals, metabolomics may help 
us to define the specific enzymatic targets that 
underlie fungicidal action. This could lead to 
the discovery of synergistic drug pairings and 
might help to explain why some fungicides are 
toxic to humans”. 

Piotrowski and colleagues’ demonstration 
of synergistic effects between poacic acid and 
human antifungal drugs is promising, suggest- 
ing that poacic acid and derivatives might have 
roles in medicine. But although poacic acid is 
probably also synergistic with agricultural fun- 
gicides, this remains to be examined and will be 
a crucial part of future studies. Finally, before 
poacic acid can be translated into a market- 
ready fungicide there are still major hurdles 
to be overcome, including testing its effective- 
ness in field trials, efficacy against more-diverse 
pathogens and persistence in the environment, 
and establishing whether it is safe for humans. 

Although much work is still needed, this 
study is a timely illustration of how the right 
biological techniques, combined with resource- 
fulness — in this case discovering fungicides in 
waste biomass — can be used to gain insights 
into drug action. The work also shows how 
understanding drug mechanisms can help to 
identify synergistic drug pairings. This strat- 
egy holds great potential for rationally designed 
combination therapies that could extend the 
potency and lifetimes of existing fungicides. On 
its own, in combination with other fungicides 
oras alead compound for further development, 
poacic acid is a potentially valuable new weapon 
in our armamentarium of antifungals. = 
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Steps on the road 
to eukaryotes 


Anew archaeal phylum represents the closest known relatives of eukaryotes, the 
group encompassing all organisms that have nucleated cells. The discovery holds 
promise for a better understanding of eukaryotic origins. SEE ARTICLE P.173 


T. MARTIN EMBLEY & TOM A. WILLIAMS 


here are many competing hypoth- 
eses' for how eukaryotic cells, which 
contain a nucleus and other mem- 
brane-bound organelles, evolved from their 
prokaryotic ancestors, whose cells lack a 
nucleus. But testing these theories has been 
difficult owing to a lack of known interme- 
diate stages in the prokaryote-to-eukaryote 
transition. On page 173 of this issue, Spang 
et al.’ describe a prokaryotic lineage that is 
more closely related to eukaryotes than any 
yet sampled and that shares with eukaryotes 
several genes previously thought to define 
aspects of eukaryotic biology. This techni- 
cally outstanding paper has far-reaching 
implications for how we view early eukaryotic 
evolution, including our own deep ancestry. 
In most textbooks the cellular world is 
divided into three domains’: the eukary- 
otes (Eukarya) and two distinct prokaryotic 
groups, the Bacteria and the Archaea. In the 
classical three-domains tree, the eukaryotes 
are separated from a common prokaryotic 
ancestor that they share with Archaea bya long 
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branch that has been variously interpreted as 
representing a long period of time with 
unsampled diversity, a high rate of evolution 
in the eukaryotic ancestor, or the extinction 
of intermediate forms. Surveys of environ- 
mental microbial diversity using the tools of 
molecular biology have sought to populate this 
long branch, but have so far failed to identify 
any fundamentally new eukaryotic groups. In 
the three-domains tree, the eukaryotes appear 
fully formed, with almost all of the cellular 
complexity that we associate with modern 
eukaryotes already in place’. 

The three-domains tree is the most visible 
image depicting the diversity of cellular life, 
but it has not gone unchallenged. An alterna- 
tive two-domains tree, in which the eukaryotic 
lineage originated within the archaeal domain, 
has gathered support from recent phylogenetic 
analyses*° and is now arguably the favoured 
hypothesis. In this tree, the eukaryotes are 
related to a diverse group of Archaea called the 
TACK superphylum’. Thus, unlike the three- 
domains tree, the two-domains tree includes 
an explicit prediction about where we should 
look for closer relatives of the eukaryotic 
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Figure 1 | Lokiarchaeota are the closest known prokaryotic relatives of eukaryotes. Phylogenetic trees 
presented by Spang et al.” place eukaryotes within the Lokiarchaeota — a new group of Archaea described 
by the authors. The genomes of these Archaea contain more eukaryotic-like genes than other known 
Archaea. This finding implies that some of the defining features of eukaryotes — including a cytoskeleton, 
membrane remodelling, ubiquitin modification and the capacity for endocytosis and/or phagocytosis — 
might have already evolved in the last common ancestor of eukaryotes and Archaea. Spang and colleagues’ 
findings suggest that it is likely that other new lineages will be found to further close the evolutionary gap 
between Archaea and eukaryotes, increasing the precision with which we can identify when key cellular 
innovations such as the nucleus, mitochondrion and endoplasmic reticulum first evolved. 
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ancestral lineage. Spang et al. report the first 
spectacular results of that search. 

The authors sequenced the combined 
genomes (metagenomes) from samples of 
marine sediments that had been enriched for 
members of the Deep-Sea Archaeal Group, a 
lineage related to the TACK Archaea that has 
not been cultured in the laboratory. Using 
cutting-edge computational methods, Spang 
et al. reconstructed one largely complete and 
two partial genomes from closely related 
members of the Deep-Sea Archaeal Group, 
for which they propose the new name Loki- 
archaeota. Using slowly evolving marker genes, 
the authors constructed phylogenetic trees that 
place eukaryotes within the Lokiarchaeota at 
the base of the TACK superphylum, suggesting 
that Lokiarchaeota are the closest prokaryotic 
relatives of eukaryotes yet discovered (Fig. 1). 

Consistent with the relationship implied by 
the authors’ trees, the Lokiarchaeum genome 
contains more eukaryotic signature genes than 
do other prokaryotes, including genes encod- 
ing actin proteins, components of a primordial 
vesicle-trafficking complex, a ubiquitin-modi- 
fier system and a diverse array of small GTPase 
enzymes belonging to the Ras superfamily. 
Making reliable evolutionary trees for ancient 
relationships is difficult’, especially given 
that the split between eukaryotes and the 
Lokiarchaeota may have occurred more than 
2 billion years ago. So it is this combination 
of enhanced ‘eukaryote-like’ genome content 
and the phylogeny that make the case for a 
close relationship between eukaryotes and 
Lokiarchaeota so convincing. 

In the absence of cultured Lokiarchaeota, 
the cellular manifestation of this enhanced 
protein repertoire can be only indirectly 
inferred. Nevertheless, the genome hints 
at an organism that has a dynamic, actin- 
based cytoskeleton, vesicular-trafficking and 
membrane-remodelling capabilities, and the 
potential for uptake of materials from the 
environment by endocytosis and/or phago- 
cytosis. These are all traits that could have 
enabled the common ancestor of all eukaryotes 
to engulf the bacterial symbiont that became 
the progenitor of the mitochondrion, a vital 
organelle of modern eukaryotes'. The presence 
of these genes in an otherwise unambiguously 
archaeal genome is consistent with hypotheses 
proposing that an archaeon was the host for 
that fundamental event in the development of 
eukaryotic cells’. 

The discovery of the Lokiarchaeota provides 
strong evidence that phylogenetics can be used 
to infer ancient relationships and, in combi- 
nation with single-cell and metagenomic 
sequencing, provides a powerful toolkit for 
testing ideas about the origins of the compo- 
nent parts of eukaryotic cells’**. The same 
techniques can be applied in future studies 
to pinpoint the source of the mitochondrial 
endosymbiont, and to investigate the origins 
of the many bacterial genes that comprise a 
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major component of eukaryotic genomes. The 
sequences of genes encoding ribosomal RNA 
molecules, which are commonly used for taxo- 
nomic classification, can help to identify envi- 
ronmental samples enriched in Lokiarchaeota 
and its relatives. This will facilitate the isolation 
and cultivation of such cells for detailed study 
of their biology and metabolism. 

The identification of Lokiarchaeota so early 
in the history of this nascent field suggests 
that more-closely related archaeal relatives of 
eukaryotes will soon be discovered. Some of 
these may be related to the Deep-Sea Archaeal 
Group’, an enormously diverse and abundant 
radiation of Archaea for which the Loki- 
archaeota genomes are the first available. The 
genomes and cellular features of these relatives 
may provide a more detailed picture of the most 
recent common ancestor of eukaryotes and 
Archaea, and may help to resolve the timing of 
the innovations that are used to define eukary- 
otes. Not least, the discovery of Lokiarchaeum, 
and the promise of further discoveries from 
the vast, unexplored world of prokaryotic 
diversity, raises the tantalizing prospect that 


MOLECULAR BIOLOGY 


the investigation of eukaryotic origins can now 
enter the realm of testable science. m 
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Rap and chirp about 
X inactivation 


Two new techniques identify proteins that directly interact witha 
non-protein-coding RNA called Xist to mediate inactivation of one X chromosome 


in female mammals. SEE LETTER P.232 


ANNA ROTH & SVEN DIEDERICHS 


somes, whereas males have only one. To 

ensure that the expression of genes from the 
X chromosome is equal between the two sexes, 
one X chromosome, dubbed Xi, is inactivated 
in all female cells except eggs. X-chromosome 
inactivation (XCI) involves the long non-cod- 
ing RNA (IncRNA) Xist, but only a few proteins 
that directly bind to Xist have been identified’. 
The development of two techniques for isolat- 
ing RNA-bound proteins (one described on 
page 232 of this issue’ and one published in 
Cell*) now sheds light on more Xist-interacting 
proteins that havea role in XCI. 

In 1961, the Lyon hypothesis posited that 
XCI occurs through the repression of genes on 
the Xi during early embryonic development 
and that this state is maintained as cells divide’. 
Decades later, the identification of Xist®, which 
is expressed exclusively from the Xi, provided 
an explanation for how such gene silencing 
arises. During XCI initiation, the Xi becomes 
coated in Xist, which establishes a repressive 


lE mammals, females have two X chromo- 
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environment in which the proteins involved 
in transcription are displaced from the chro- 
mosome. Two protein complexes, Polycomb 
repressive complex 1 (PRC1) and PRC2 (ref. 7), 
are subsequently drawn to the Xi and modify 
the histone proteins around which DNA is 
packaged. These changes establish a stable state 
of transcriptional repression. Finally, spreading 
of Xist along the entire chromosome leads to 
silencing of genes across the Xi*. 

Despite progress in understanding XCI, 
many mechanistic details remain unclear 
because only a few of the partner proteins 
required for gene silencing by Xist have been 
found’”. The key to progress lies in identifying 
other proteins that directly interact with Xist. 
As such, unbiased ‘pull-dowr techniques that 
capture the proteins bound to a specific RNA 
are much sought after. 

McHugh et al.’ and Chu et al.* present 
highly sensitive techniques — named RNA 
antisense purification-mass spectrometry 
(RAP-MS) and comprehensive identification 
of RNA-binding proteins by mass spectro- 
metry (ChIRP-MS), respectively — that can 
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Figure 1 | Capturing RNA-protein complexes. Two groups” have 
developed techniques, called comprehensive identification of RNA 

binding proteins by mass spectrometry (ChIRP-MS) and RNA antisense 
purification—mass spectrometry (RAP-MS), for identifying the proteins that 
interact with specific RNAs in vivo. In RAP-MS, cells are grown in a protein- 
labelling solution called SILAC (stable isotope labelling by amino acids in 
culture) medium, which allows a quantitative comparison of the proteins that 
interact with the RNA of interest and with a control RNA. In both techniques, 
RNA-protein interactions are preserved by crosslinking, through either 


be used to identify the proteins that interact 
with Xist in vivo (Fig. 1). The methods both 
involve a crosslinking step that preserves 
naturally occurring RNA-protein complexes, 
which are then captured using DNA sequences 
(90 nucleotides long in RAP-MS, and 20 nucle- 
otides long in ChIRP-MS) that are complemen- 
tary to, and so bind to, the RNA sequence of 
interest. The DNA is tagged at one end with a 
biotin molecule, which interacts with a protein 
called streptavidin on magnetic beads. Only 
RNA-protein complexes bound to the biotin- 
tagged DNA are captured by the beads. Finally, 
the proteins are eluted from the beads and 
identified through mass spectrometry. 

A key difference between the two protocols 
is that in RAP-MS, cells are grown in SILAC 
(stable isotope labelling by amino acids in cul- 
ture) medium, which contains the amino acids 
lysine and arginine, labelled with either heavy 
or light isotopes. Proteins are then labelled 
with heavy or light isotopes depending on the 
medium used, allowing a quantitative com- 
parison of the proteins pulled down with Xist 
compared to those pulled down with a control 
RNA molecule. By contrast, ChIRP-MS uses 
a standard culture medium. 

Using ChIRP-MS, Chu and colleagues 
captured 81 Xist-bound proteins in four differ- 
ent types of mouse embryonic stem cell — one 
type that has the potential to differentiate into 
all bodily cell types, and three that are prepar- 
ing to differentiate into more-specialized cells. 
Although most proteins bound Xist in all four 
types of cell, 19 were not found in the most 
primitive cell type. The two sets of proteins 
might therefore reflect stepwise changes in Xist 
binding patterns that arise as embryonic stem 
cells begin to differentiate. 

The ‘repeat A region is an evolutionarily 
conserved RNA sequence close to the 5’ end of 
Xist that is implicated in transcriptional silen- 
cing’. Chu et al. found that Xist molecules that 
lack the repeat A region cannot bind to three 
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of the proteins that are pulled down by the full- 
length RNA. One such protein is the transcrip- 
tional repressor Spen (also known as Sharp). 
The authors found that Spen directly interacts 
with the repeat A region in vivo and in vitro. 
Furthermore, they demonstrate that Spen and 
two other proteins, HnrnpK and HnrnpU, are 
involved in gene silencing during XCI, and pro- 
vide evidence to suggest that HnrnpK might 
mediate PRC2 recruitment to the Xi. 

McHugh et al. used RAP-MS to identify 
ten proteins that directly bind to Xist, three of 
which — Spen, Lbr and HnrnpU — are nec- 
essary for silencing of genes on the Xi. The 
authors report that Spen represses transcrip- 
tion in the early stages of XCI, by binding to 
its co-repressor protein, Smrt (ref. 10), which 
activates Hdac3 (ref. 11). This enzyme removes 
acetyl groups from histones, thereby promoting 
transcriptional repression. The authors propose 
that Xist-PRC2 interactions are independent of 
the repeat A region but are indirectly mediated 
by the Spen-Smrt-Hdac3 complex through an 
as-yet-unknown mechanism. 

The development of techniques for analysing 
RNA-protein interactions is instrumental 
to the success of both of the current studies. 
Impressively, nine out of ten proteins iden- 
tified by RAP-MS are also pulled down by 
ChIRP-MS, demonstrating the specificity of 
both techniques. The difference in the number 
of Xist-interacting proteins identified might 
arise from differences in methodology, the 
cell models and control RNAs used or the 
thresholds used to determine whether an inter- 
action is significant. Each group validated its 
method using cellular non-coding RNAs with 
well-characterized binding proteins, rather 
than IncRNAs. Future studies should test the 
efficiency of the techniques using exten- 
sively studied and abundant IncRNAs, such 
as MALAT1 (ref. 12), and, most importantly, 
using less-abundant IncRNAs, because most 
IncRNAs exist in low numbers in cells. 
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ultraviolet irradiation or formaldehyde treatment. Cells are homogenized, and 
the RNA-protein complexes within are bound by DNA probes — strands of 90 
or 20 nucleotides (nt) that are complementary to the RNA of interest, tagged 
with a biotin molecule at one end. The complexes are then captured on beads 
coated in streptavidin protein, which interacts strongly with biotin. In RAP- 
MS, the proteins are eluted from the beads by degradation (digestion) of the 
RNA and DNA. By contrast, in ChIRP-MS addition of extra biotin molecules 
interferes with biotin-streptavidin binding to displace the complexes. Finally, 
proteins are identified by mass spectrometry. 


Both studies provide some mechanistic 
insight into XCI. Neither work captured 
components of PRC2, supporting a model in 
which Xist-PRC2 interactions are indirectly 
mediated by HnrnpK and Spen or by PRC1. 
Alternatively, different isoforms of Xist might 
interact with different proteins. Defining the 
roles of other interaction partners in Xist- 
mediated XCI will shed more light on the 
complex mechanisms underlying this process. 

More than 10,000 IncRNAs have now been 
discovered — some of which have vital roles 
in health and disease!*. However, the cellular 
functions and molecular mechanisms of most 
IncRNAs are unknown. ChIRP-MS and RAP- 
MS are powerful and versatile techniques that 
have the potential to boost RNA research by 
reliably identifying the proteins that interact 
with any specific RNA sequence in vivo. Nota- 
bly, they may enable the identification of entire 
IncRNA-binding networks, including proteins, 
DNA and RNA — a key step towards elucidat- 
ing the pathways in which IncRNAs function. m 
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Complex archaea that bridge the gap 
between prokaryotes and eukaryotes 


Anja Spang'*, Jimmy H. Saw!*, Steffen L. J ergensen**, Katarzyna Zaremba-Niedzwiedzka'™, Joran Martijn’, Anders E. Lind’, 
Roel van Eijk'+, Christa Schleper??, Lionel Guy'* & Thijs J. G. Ettema! 


The origin of the eukaryotic cell remains one of the most contentious puzzles in modern biology. Recent studies 
have provided support for the emergence of the eukaryotic host cell from within the archaeal domain of life, but 
the identity and nature of the putative archaeal ancestor remain a subject of debate. Here we describe the discovery 
of ‘Lokiarchaeota’, a novel candidate archaeal phylum, which forms a monophyletic group with eukaryotes in 
phylogenomic analyses, and whose genomes encode an expanded repertoire of eukaryotic signature proteins that are 
suggestive of sophisticated membrane remodelling capabilities. Our results provide strong support for hypotheses in 
which the eukaryotic host evolved from a bona fide archaeon, and demonstrate that many components that underpin 
eukaryote-specific features were already present in that ancestor. This provided the host with a rich genomic 
‘starter-kit’ to support the increase in the cellular and genomic complexity that is characteristic of eukaryotes. 


Cellular life is currently classified into three domains: Bacteria, 
Archaea and Eukarya. Whereas the cytological properties of 
Bacteria and Archaea are relatively simple, eukaryotes are character- 
ized by a high degree of cellular complexity, which is hard to reconcile 
given that most hypotheses assume a prokaryote-to-eukaryote trans- 
ition’. In this context, it seems particularly difficult to account for the 
suggested presence of the endomembrane system, the nuclear pores, 
the spliceosome, the ubiquitin protein degradation system, the RNAi 
machinery, the cytoskeletal motors and the phagocytotic machinery 
in the last eukaryotic common ancestor (ref. 3 and references therein). 
Ever since the recognition of the archaeal domain of life by Carl Woese 
and co-workers*°, Archaea have featured prominently in hypotheses 
for the origin of eukaryotes, as eukaryotes and Archaea represented 
sister lineages in Woese’s ‘universal tree’. The evolutionary link 
between Archaea and eukaryotes was further reinforced through stud- 
ies of the transcription machinery® and the first archaeal genomes’, 
revealing that many genes, including the core of the genetic informa- 
tion-processing machineries of Archaea, were more similar to those of 
eukaryotes® rather than to Bacteria. During the early stages of the 
genomic era, it also became apparent that eukaryotic genomes were 
chimaeric by nature*’, comprising genes of both archaeal and bacterial 
origin, in addition to genes specific to eukaryotes. Yet, whereas many of 
the bacterial genes could be traced back to the alphaproteobacterial 
progenitor of mitochondria, the nature of the lineage from which the 
eukaryotic host evolved remained obscure’’*’. This lineage might 
either descend from a common ancestor shared with Archaea (follow- 
ing Woese’s classical three-domains-of-life tree*), or have emerged 
from within the archaeal domain (so-called archaeal host or eocyte-like 
scenarios’'*"'’). Recent phylogenetic analyses of universal protein data 
sets have provided increasing support for models in which eukaryotes 
emerge as sister to or from within the archaeal ‘TACK’ superphylum’*”, 
a clade originally comprising the archaeal phyla Thaumarchaeota, 
Aigarchaeota, Crenarchaeota and Korarchaeota~’. In support of this 
relationship, comparative genomics analyses have revealed several 
eukaryotic signature proteins (ESPs)** in TACK lineages, including dis- 


tant archaeal homologues of actin*’ and tubulin’’, archaeal cell division 
proteins related to the eukaryotic endosomal sorting complexes 
required for transport (ESCRT)-III complex”, and several informa- 
tion-processing proteins involved in transcription and translation®’””’. 
These findings suggest an archaeal ancestor of eukaryotes that might 
have been more complex than the archaeal lineages identified thus 
far’**?®. Yet, the absence of missing links in the prokaryote-to-eukaryote 
transition currently precludes detailed predictions about the nature and 
timing of events that have driven the process of eukaryogenesis'*"””*. 
Here we describe the discovery of a new archaeal lineage related to the 
TACK superphylum that represents the nearest relative of eukaryotes in 
phylogenomic analyses, and intriguingly, its genome encodes many 
eukaryote-specific features, providing a unique insight in the emergence 
of cellular complexity in eukaryotes. 


Genomic exploration of new TACK archaea 


While surveying microbial diversity in deep marine sediments influ- 
enced by hydrothermal activity from the Arctic Mid-Ocean Ridge, 16S 
rRNA gene sequences belonging to uncultivated archaeal candidate 
lineages were identified in a gravity core (GC14) sampled approximately 
15km north-northwest of the active venting site Loki’s Castle” at 
3283m_ below sea level (73.763167N, 8.464000E) (Fig. 1a)*°’. 
Subsequent phylogenetic analyses of these sequences, which comprised 
~10% of the obtained 16S reads, revealed that they belonged to the 
gamma clade of the Deep-Sea Archaeal Group/Marine Benthic Group B 
(hereafter referred to as DSAG)*'* (Fig. 1b-d and Supplementary 
Figs 1 and 2), a clade proposed to be deeply-branching in the TACK 
superphylum”*. DSAG constitutes one of the most abundant and widely 
distributed archaeal groups in the deep marine biosphere, but so far 
none of its representatives have been cultured or sequenced”’. 

To obtain genomic information for this archaeal lineage, we applied 
deep metagenomic sequencing to the GC14 sediment sample, resulting 
in a smaller (LCGC14, 8.6 Gbp) and a larger, multiple-strand displace- 
ment amplified (MDA) metagenome data set (LCGCI4AMP, 56.6 Gbp; 
Fig. 2a; Supplementary Fig. 3 and Supplementary Table 1). Given the 
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Figure 1 | Identification of a novel archaeal 
lineage. a, Bathymetric map of the sampling site 
(GC14; red circle) at the Arctic Mid-Ocean 
Spreading Ridge, located 15 km from Loki’s Castle 
active vent site. b, 16S rRNA amplicon-based 
assessment of microbial diversity in GC14. Bars on 
the left represent the fraction of the respective 
prokaryotic taxa and bars on the right depict 
archaeal diversity. Numbers refer to operational 
taxonomic units for each group. MHVG, Marine 
Hydrothermal Vent Group; DHVEG-6, Deep-sea 
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deeper coverage, the latter data set was used to extract marker genes that 
carry an evolutionary coherent phylogenetic signal (Supplementary 
Tables 2 and 3). Using single gene phylogenies of these markers, contigs 
attributable to either one of the archaeal lineages present in the 
LCGC14AMP metagenome (DSAG, DSAG-related, DPANN and 
Thaumarchaeota), could be extracted. These taxon-specific contigs were 
used as training sets for supervised binning of contigs present in both the 
LCGC14 and LCGC14AMP metagenomes (Supplementary Fig. 4). This 
approach resulted in the identification of two DSAG bins (from 
LCGC14 and LCGC14AMP, respectively) as well as one DSAG-related 
bin (bin Loki2/3 from LCGC14AMP). We focused on the DSAG bin 
from the non-amplified data set to avoid potential biases introduced by 
MDA (see Methods). The analyses of the low-abundant DSAG-related 
lineages were based on the MDA-amplified LCGC14AMP data set. 

After removal of small (<1 kbp) and low-coverage contigs (Supple- 
mentary Fig. 5), reads mapping to the remaining DSAG bin contigs were 
reassembled into 504 contigs, yielding a 92% complete, 1.4 fold-redund- 
ant composite genome (‘Lokiarchaeum’) of 5.1 Mbp, which encodes 
5,381 protein coding genes as well as single copies of the 16S and 23S 
rRNA genes (Supplementary Table 4 and Supplementary Discussion 1). 
The DSAG-related bin (Loki2/3 from LCGC14AMP) was found to 
contain two low-abundant, distinct lineages, displaying slight but 
marked differences in GC content of 32.8 and 29.9%, allowing for 
separation into two distinct groups (Loki2 and Loki3) (Supple- 
mentary Fig. 6). Since these two lineages represent low-abundance 
community members, only partial genomes could be recovered. The 
Loki2/3 contigs did not contain 16S rRNA genes, rendering it imposs- 
ible to attribute them to any of the uncultured archaeal 16S phylotypes 
identified in the GC14 sediments, such as the low-abundance Marine 
Hydrothermal Vent Group archaea (abundance ~0.05%; Fig. Ic). 
However, phylogenetic marker genes were extracted for these lineages 
as well (21 and 34 markers for Loki2 and Loki3, respectively) since their 
inclusion was potentially useful in resolving the phylogenetic placement 
of the Lokiarchaeum lineage. 


Lokiarchaeota and Eukarya are monophyletic 


To determine the phylogenetic affiliation of Lokiarchaeum and the 
Loki2/Loki3 lineages, maximum-likelihood and Bayesian inference 
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phylogenetic analyses were performed, using sophisticated models of 
molecular sequence evolution. By implementing relaxed assumptions 
of homogeneous amino acid composition across sites or across 
branches of the tree, these models are less sensitive to long-branch 
attraction and other phylogenetic artefacts. Both maximum-likelihood 
and Bayesian inference analyses of concatenated alignments compris- 
ing 36 conserved phylogenetic marker proteins*® (Supplementary 
Tables 2 and 3) revealed that the DSAG and DSAG-related archaea 
(hereafter referred to as ‘Lokiarchaeota’) represent a monophyletic, 
deeply branching clade of the TACK superphylum. Loki3 represented 
the deepest branch of the Lokiarchaeota, and Lokiarchaeum and 
Loki2 were inferred to be sister lineages with maximum support 
(Supplementary Fig. 7). Intriguingly, when eukaryotes were included 
in our phylogenetic analyses, they were confidently positioned within 
the Lokiarchaeota (posterior probability = 1; bootstrap support = 80; 
Fig. 2b; Supplementary Figs 8 and 9), as the sister group of the 
Loki3 lineage (Fig. 2b). Robust assessment of these phylogenetic 
inferences (Supplementary Figs 10-14 and Supplementary Table 5) 
revealed strong support for the Lokiarchaeota—Eukarya affiliation 
(Supplementary Discussion 2). 

The proposed naming of the Eukarya-affiliated candidate phylum 
Lokiarchaeota and the Lokiarchaeum lineage is made in reference to the 
sampling location, Loki’s Castle”, which in turn was named after the 
Norse mythology’s shape-shifting deity Loki. Loki has been described 
as “a staggeringly complex, confusing, and ambivalent figure who has 
been the catalyst of countless unresolved scholarly controversies”, in 
analogy to the ongoing debates on the origin of eukaryotes. 


Presence of diverse and abundant ESPs 

As our phylogenetic analyses strongly support a common ancestry of 
Lokiarchaeota and eukaryotes, we investigated the presence of putat- 
ive ESPs™ in the composite Lokiarchaeum genome. The amount of 
genomic data obtained for the Loki2/3 lineages was too low to per- 
form detailed gene content analyses. A comparative taxonomic 
assessment of the Lokiarchaeum composite proteome revealed that 
a large fraction (32%) ofits proteins displayed no significant similarity 
to any known protein, and that roughly as many proteins display 
highest similarity to archaeal and bacterial proteins (26% and 29%, 
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Figure 2 | Metagenomic reconstruction and phylogenetic analysis of 
Lokiarchaeum. a, Schematic overview of the metagenomics approach. BI, 
Bayesian inference; ML, maximum likelihood. b, Bayesian phylogeny of 
concatenated alignments comprising 36 conserved phylogenetic marker proteins 
using sophisticated models of protein evolution (Methods), showing eukaryotes 
branching within Lokiarchaeota. Numbers above and below branches refer to 
Bayesian posterior probability and maximum-likelihood bootstrap support 
values, respectively. Posterior probability values above 0.7 and bootstrap support 
values above 70 are shown. Scale indicates the number of substitutions per site. 
c, Phylogenetic breakdown of the Lokiarchaeum proteome, in comparison with 
proteomes of Korarchaeota, Aigarchaeota (Caldiarchaeum) and Miscellaneous 
Crenarchaeota Group (MCG) archaea. Category ‘Other’ contains proteins 
assigned to the root of cellular organisms, to viruses and to unclassified proteins. 


respectively; Fig. 2c and Supplementary Fig. 15), which is in accord- 
ance with recent findings that suggest major inter-domain gene 
exchange between Bacteria and Archaea**** (Supplementary 
Discussion 3). Most notably, a significant part of the predicted 
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proteome (175 proteins or 3.3%) was most similar to eukaryotic pro- 
teins (Fig. 2c) and revealed a dominance of proteins, which in eukar- 
yotes are involved in membrane deformation and cell shape formation 
processes, including phagocytosis*” (Extended Data Table 1 and 
Supplementary Table 6). Several lines of evidence support that the 
presence of these proteins is not the result of potential contaminating 
eukaryotic sequence data. First, genes encoding Lokiarchaeum ESPs 
and other proteins most similar to eukaryotes were always flanked by 
prokaryotic genes (Supplementary Fig. 16), and most were encoded by 
contigs that also contained archaeal signature genes. Second, ESP- 
encoding contigs displayed high (>20X) read coverage, while eukar- 
yotic sequences could not be detected in the LCGC14 data set, and 
represented only a negligible fraction of the LCGC14AMP metagen- 
ome. Furthermore, the amplicon data generated with universal 16S/ 
18S primers did not reveal any 18S rRNA genes of eukaryotic origin 
(Fig. 1b). Third, phylogenetic analyses of several Lokiarchaeal ESPs 
revealed their emergence at the base of eukaryotic clades (see below), 
indicating that these proteins represent archaeal out-groups of the 
eukaryotic proteins rather than being truly eukaryotic in origin. 
Fourth, Lokiarchaeum appears to contain bona fide archaeal informa- 
tional processing machineries (Supplementary Discussion 4 and 
Supplementary Tables 7-9) and, irrespective of the significant amount 
of ESPs in its genome, lacks many other key eukaryotic features. Finally, 
we could also identify highly similar homologues of the Lokiarchaeal 
ESPs in a recent and independently generated marine sediment meta- 
genome derived from a sediment core sample off the Shimokita 
Peninsula of Japan, in which DSAG comprises a significant part of 
the microbial community**. As the function and evolution of the 
Lokiarchaeal ESPs hold relevance for understanding the origin of the 
eukaryotic cell, we review some of the key findings in more detail below. 


Potential dynamic actin cytoskeleton 


Actins represent key structural proteins of eukaryotic cells and com- 
prise filaments that are crucial for various cellular processes, including 
cell division, motility, vesicle trafficking and phagocytosis. The 
Lokiarchaeum genome encodes five actin homologues that display 
higher similarity to eukaryotic actins and actin-related proteins 
(ARPs) than to crenactins, a group of archaeal actin homologues that 
were recently shown to be involved in cell shape formation**”*° 
(Supplementary Table 6). This observation was confirmed in a phylo- 
genetic analysis of the Lokiarchaeal actins that also included homo- 
logues identified in a recently published marine sediment 
metagenome” (up to 99% identity) as well as in the LCGC14 and 
LCGC14AMP metagenomes (Fig. 3a and Supplementary Fig. 17). 
Lokiarchaeal actins (‘Lokiactins’) comprise several distinct clusters, 
some of which branch at the base of distinctive eukaryotic actin and 
ARP clusters, albeit with weak support (Fig. 3a). Despite the poor 
resolution of several deeper nodes in the actin tree, strong support 
is provided for a common ancestry of Lokiactins and eukaryotic 
actins, indicating that the proliferation of actins already occurred in 
the archaeal ancestor of eukaryotes. Notably, the Lokiarchaeum gen- 
ome also encodes several hypothetical short proteins containing gel- 
solin-like domains that so far appear to be absent from bacterial and 
any other archaeal genomes (Extended Data Table 1, Supplementary 
Tables 6 and 10 and Supplementary Discussion 5). In eukaryotes, 
these protein domains are part of the villin/gelsolin superfamily of 
proteins, which comprise various key regulators of actin filament 
assembly and disassembly**. Although the function of these hypothet- 
ical gelsolin-domain proteins remains to be elucidated, it is tempting 
to speculate that Lokiarchaeum has a dynamic actin cytoskeleton. 


Genomic expansion of small GTPases 

Small GTPases belonging to the Ras superfamily comprise one of the 
largest protein families in eukaryotes, where they are involved in various 
regulatory processes, including cytoskeleton remodelling, signal trans- 
duction, nucleocytoplasmic transport and vesicular trafficking’. Being 
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Figure 3 | Identification and phylogeny of small GTPases and actin 
orthologues. a, Maximum-likelihood phylogeny of 378 aligned amino acid 
residues of actin homologues identified in Lokiarchaeum and in the 
LCGC14AMP metagenome, including eukaryotic actins, ARP1-3 homologues 
and crenactins*’. Consecutive numbers in brackets refer to the number of 
sequences in a respective clade from LCGC14AMP and Lokiarchaeum, 
respectively. b, Relative amount of small GTPases (assigned to IPR006689 and 
IPRO01806) in the Lokiarchaeum genome in comparison with other eukaryotic, 


key regulators of actin cytoskeleton dynamics, these small GTPases 
represent essential components for the process of phagocytosis in 
eukaryotes. Intriguingly, the analysis of Lokiarchaeal ESPs revealed 
a multitude of Ras-superfamily GTPases, comprising nearly 2% of the 
Lokiarchaeal proteome (Fig. 3b). The relative amount of small 
GTPases in the Lokiarchaeum genome is comparable to that observed 
in several unicellular eukaryotes, only being surpassed by the protist 
Naegleria gruberi. In contrast, bacterial and archaeal genomes encode 
only few, if any, small GTPase homologues of the Ras superfamily 
(Fig. 3b). 

Phylogenetic analyses of the Lokiarchaeal small GTPases revealed 
that these represent several distinct clusters, each of which comprises 
several GTPase sequences (Fig. 3c and Supplementary Fig. 18). 
Although phylogenetic analyses failed to resolve most of the deeper 
nodes, several of the eukaryotic small GTPase families appear to share 
a common ancestry with Lokiarchaeal GTPases (Fig. 3c), suggesting 
an archaeal origin of specific subgroups of the eukaryotic small 
GTPases, followed by independent expansions in eukaryotes and 
Lokiarchaeota. This scenario contrasts with previous studies that have 
suggested that eukaryotic small GTPases were acquired from the 
alphaproteobacterial progenitor of mitochondria”. 

Although genes encoding canonical eukaryotic GTPase-activating 
proteins (GAPs) were absent in Lokiarchaeota, twelve roadblock/ 
LC7-domain-containing proteins were identified (Supplementary 
Tables 6 and 10). While such proteins have been implicated in dynein 
organization in eukaryotes, roadblock/LC7 protein MglB of the bac- 
terium Myxococcus xanthus was shown to act as a GAP of the small 
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archaeal and bacterial species. Numbers refer to total amount of small GTPases 
per predicted proteome. c, Maximum-likelihood phylogeny of 150 aligned 
amino acid residues of small Ras- and Arf-type GTPases (IPR006689 and 
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GTPase MglA*’. Hence, the Lokiarchaeal roadblock/LC7 proteins 
represent possible candidates for alternative GAPs in this archaeon. 


Presence of a primordial ESCRT complex 


In eukaryotes, the ESCRT machinery represents an essential com- 
ponent of the multivesicular endosome pathway for lysosomal 
degradation of damaged or superfluous proteins, and it plays a role 
in several budding processes including cytokinesis, autophagy and 
viral budding’. The ESCRT machinery generally consists of the 
ESCRT-I-III subcomplexes, as well as associated subunits**. The 
analysis of the Lokiarchaeum genome revealed the presence of an 
ESCRT gene cluster (Fig. 4a), as well as of several additional pro- 
teins homologous to components of the eukaryotic multivesicular 
endosome pathway. For instance, Lokiarchaeum encodes divergent 
SNF7 domain proteins of the eukaryotic ESCRT-III complex, which 
appear to represent members of the Vps2/Vps24/Vps46 and Vps20/ 
Vps32/Vps60 families, respectively. A phylogenetic analysis of the 
Lokiarchaeal SNF7 domain proteins revealed that these branch at 
the base of these two eukaryotic ESCRT-III families with low boot- 
strap support (Fig. 4b and Supplementary Fig. 19), not only indi- 
cating that they might represent ancestral SNF7 copies, but also 
suggesting that the last eukaryotic common ancestor already inher- 
ited two divergent SNF7-domain-encoding genes from its putative 
archaeal ancestor rather than a single gene**. Furthermore, the gene 
cluster encodes an ATPase that displays closest resemblance to 
eukaryotic VPS4-type ATPases, including katanin, membrane scaf- 
fold protein (MSP) and spastin (Fig. 4c and Supplementary Fig. 20) 
as well as hypothetical proteins that show significant similarity 
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Figure 4 | Identification of ESCRT components in the Lokiarchaeum 
genome. a, Schematic overview of ESCRT gene clusters identified in Loki- 
archaeum and Loki2/3. Intensity of shading between homologous sequences is 
correlated with BLAST bit score. b, Maximum-likelihood phylogeny of 207 
aligned amino acid residues of ESCRT-III homologues identified in Loki- 
archaeum, LCGC14AMP and other archaeal lineages. Eukaryotic homologues 
include the two distantly related families Vps2/24/46 and Vps20/32/60. 
Bootstrap support values above 50 are shown. c, Maximum-likelihood 


to EAP30-domain-containing proteins (Vps36/22) and Vps25, 
respectively (Fig. 4a; Supplementary Figs 21 and 22). In eukaryotes, 
Vps22, Vps25 and Vps36 are components of the ESCRT-II com- 
plex, which comprises two to three of these proteins depending on 
the eukaryotic species**. In addition, a protein domain analysis of 
the Lokiarchaeum proteome identified a Vps28-like protein, a com- 
ponent of the eukaryotic ESCRT-I subcomplex that links the ubi- 
quitin pathway to vesicular transport and which, apart from Vps28, 
comprises Vps23 and Vps37 (Extended Data Table 1 and 
Supplementary Fig. 23). The different subunits of the eukaryotic 
ESCRT-I complex share similar two-helix core domains and have 
been suggested to have evolved from a single ancestral sequence*’, 
which we now propose to be of archaeal origin. 

Finally, the Lokiarchaeum proteome was found to contain hypo- 
thetical proteins containing Longin-like domains, as well as several 
proteins belonging to the BAR/IMD superfamily (Supplementary 
Tables 6 and 10), comprising curvature sensing protein families 
involved in various aspects of vesicle/membrane trafficking or 
remodelling processes in eukaryotes. These findings suggest that 
Lokiarchaeum contains a primordial version of a eukaryotic ESCRT 
vesicle trafficking complex. In eukaryotes, ubiquitylation of target 
proteins represents a critical step in ESCRT-mediated protein degra- 
dation through the multivesicular endosome pathway****. The 
Lokiarchaeum genome contains a gene cluster that encodes several 
components required for a functional ubiquitin modifier system, 
including homologues for ubiquitin-activating enzyme E1, ubiqui- 
tin-conjugating enzyme E2, and 26S proteasome regulatory subunit 
RPN11. In addition, several hypothetical proteins with ubiquitin-like 
domains were identified in Lokiarchaeum, as well as diverse zinc- 
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phylogeny of 388 aligned amino acid residues of AAA-type Vps4 ATPases 
including representatives for each of the four major eukaryotic sub-groups 
(membrane scaffold protein (MSP), katanin, spastin/fidgetin and Vps4) as well 
as homologues identified in the Lokiarchaeum genome, in LCGC14AMP and 
in sequenced archaeal genomes. Bootstrap support values below 45 are not 
shown. b, ¢, Scale indicates the number of substitutions per site. Numbers in 
brackets refer to the number of sequences in the respective clades. 


finger/RING-domain-containing proteins, some of which might 
serve as candidates for E3 ubiquitin protein ligases (Supplementary 
Tables 6 and 10). Several of these components have also been iden- 
tified in Aigarchaeota”. 


A ‘complex’ archaeal ancestor of Eukarya 


We have identified and characterized the genome of Lokiarchaeota, a 
novel, deeply rooting clade of the archaeal TACK superphylum, 
which in phylogenomic analyses of universal proteins forms a mono- 
phyletic group with eukaryotes. While the obtained phylogenomic 
resolution testifies to a deep archaeal ancestry of eukaryotes, the 
Lokiarchaeum genome content holds valuable clues about the nature 
of the archaeal ancestor of eukaryotes, and about the process of eukar- 
yogenesis. Many of the ESPs previously identified in different TACK 
lineages are united in Lokiarchaeum, indicating that the patchy 
distribution of ESPs amongst archaea is most likely the result of 
lineage-specific losses’ (Fig. 5). Moreover, the Lokiarchaeum genome 
significantly expands the total number of ESPs in Archaea, lending 
support to the observed phylogenetic affiliation of Lokiarchaeota and 
eukaryotes. Finally, and importantly, sequence-based functional pre- 
dictions for these new ESPs indicate a predominance of proteins that 
play pivotal roles in various membrane remodelling and vesicular 
trafficking processes in eukaryotes. It is also noteworthy that 
Lokiarchaeum appears to encode the most ‘eukaryotic-like’ ribosome 
identified in Archaea thus far (Supplementary Discussion 4), includ- 
ing a putative homologue of eukaryotic ribosomal protein L22e 
(Fig. 5; Supplementary Fig. 24 and Supplementary Tables 7 and 8). 
Taken together, our data indicate that the archaeal ancestor of eukar- 
yotes was even more complex than previously inferred’ and allow us to 
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speculate on the timing and order of several key events in the process of 12 


eukaryogenesis. For example, the identification of archaeal genes 
involved in membrane remodelling and vesicular trafficking processes 13. 
indicates that the emergence of cellular complexity was already under- 
way before the acquisition of the mitochondrial endosymbiont, which 1, 
now appears to be a universal feature of all eukaryotes***”*°. Indeed, 15. 
based upon our results it seems plausible that the archaeal ancestor of 
eukaryotes had a dynamic actin cytoskeleton and potentially endo- 
and/or phagocytic capabilities, which would have facilitated the inva- 17. 
gination of the mitochondrial progenitor. 

The present identification and genomic characterization of a novel 
archaeal group that shares a common ancestry with eukaryotes indi- 
cates that the gap between prokaryotes and eukaryotes might, tosome 19. 
extent, be a result of poor sampling of the existing archaeal diversity. 
Environmental surveys have revealed the existence of a plethora of 
uncultured archaeal lineages, and some of these likely represent even 
closer relatives of eukaryotes. Excitingly, the genomic exploration of 
these archaeal lineages has now come within reach. Such endeavours, 
combined with prospective studies focusing on uncovering metabolic, 
chemical and cell biological properties of these lineages, will uncover 
further details about the identity and nature of the archaeal ancestor 
of eukaryotes, shedding new light on the evolutionary dark ages of the 
eukaryotic cell. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Sampling site and sample description. A 2-m long gravity core (GC14) was 
retrieved from the Arctic Mid-Ocean Ridge during summer 2010 (approximately 
15 km north-northwest of the active venting site Loki’s Castle; 3283 m below sea 
level; 73.763167 N, 8.464000 E) (Fig. 1a). Samples for geochemistry and micro- 
biology were collected immediately and either processed on board or frozen for 
later analysis. Upon port arrival, the core was stored in sealed core liners at 4 °C 
(core depository facility, University of Bergen, Norway). Comprehensive geo- 
chemical and microbial characteristics from this and adjacent sites have been 
described elsewhere*’*”. The core consists of hemipelagic-glaciomarine sedi- 
ments receiving episodic hydrothermal input. The oxygen penetration depth 
was estimated to ~50 cm below sea floor (b.s.f.) and the content of organic carbon 
varied between 0.6 - 1.3%. While no measurable amounts of methane or sulphide 
could be measured, high and fluctuating levels of dissolved iron were detected. 
The relative abundance of bacterial and archaeal 16S rRNA gene copy numbers 
was estimated by quantitative PCR (qPCR) previously”, indicating high abund- 
ance of the DSAG in several of the investigated sediment horizons, especially at 
75cm b.s.f. (up to 40% of the total prokaryotic population; 2.7 10° copies per 
gram sediment). Thus, sample material from horizon at 75 cm b.s.f. was used for 
all downstream analyses including amplicon and metagenome libraries. 

DNA extraction and genomic DNA amplification. To obtain sufficient 
amounts of genomic DNA for sequencing library preparation, new sample mater- 
ial was obtained from the 75-cm-b.s.f. layer of gravity core GC14 in summer 2013. 
After qPCR-based verification of high DSAG abundance in the re-sampled 
material, DNA was extracted from 7.5 g sediment using the FastDNA spin kit 
for soil in conjunction with the FastPrep-24 instrument (MP Biomedicals) fol- 
lowing manufacturer’s protocol, except for the addition of polyadenosine as 
described in ref. 53. The individual extractions were then pooled and concen- 
trated to a final volume of 50 pl using Amicon Ultra-0.5 filters (50.000 NMWL) 
following the manufacturer’s protocol (Merck Millipore). Due to low yield and 
presence of inhibitors, 2.73 ng of this genomic DNA was amplified using the 
REPLI-g ultrafast mini kit (Qiagen) according to the standard protocol for puri- 
fied genomic DNA. 

Amplicon sequencing and analysis of 16S rDNA phylogenetic analyses. To get 
a better estimate of the microbial diversity of Loki’s Castle sediment core 
LCGC14, ‘universal’ primer pairs (A519F (5'’-CAGCMGCCGCGGTAA-3’) 
and U1391R (5'-ACGGGCGGTGWGTRC-3’)) were used to amplify a 
~900 bp fragment of the 16S rRNA genes present in the non-amplified genomic 
community DNA (extracted from LCGC14, 75cmb.s.f.) using the following 
conditions: 15 min of heat activation of polymerase at 95°C and 35 cycles of 
95 °C (30s), 54°C (45s), 72°C (60s), followed by final extension at 72 °C for 
7 min. Qiagen HotStar Taq DNA polymerase was used for the PCR reactions. 
Subsequently, PCR products of the correct size were purified with Qiagen PCR 
purification kit, and quantified using a Nanodrop ND-3300 fluorospectrometer 
(Thermo Scientific). Clean PCR products were then used as input materials for 
library construction using TruSeq DNA LT Sample Prep Kit (Illumina) according 
to the manufacturer’s instructions and applied to sequencing with an Illumina 
MiSeq instrument. The Illumina MiSeq run produced two 300-bp paired-end 
reads. Raw MiSeq fastq sequences were treated with Trimmomatic tool (v0.32)** 
using the following options: TRAILING:20, MINLEN:235 and CROP:235, to 
remove trailing sequences below a phred quality score of 20 and to achieve 
uniform sequence lengths for downstream clustering processes. Remaining traces 
of Illumina adaptor sequences were removed by SeqPrep (https://github.com/ 
jstjohn/SeqPrep) and by BLAST® searches against NCBI Univec database. 
Quality-filtered MiSeq reads were checked for correct orientation of the 16S 
rRNA sequence in the paired-end reads and those containing the forward primer 
sequence (A519F) were extracted for OTU clustering with UPARSE pipeline”, 
setting a OTU cutoff threshold to 97%. Chimeric sequences were filtered out by 
the Uchime tool*’ integrated in the UPARSE pipeline. Remaining chimeric 
sequences, if still present, were manually checked and removed. Abundances of 
each OTU were calculated by mapping the chimaera-filtered OTUs against the 
quality-filtered reads using the UPARSE pipeline. Using the mothur package 
(v1.33.2)°*, representative sequences for each OTU were aligned together with 
the Silva NR99 release-115” alignment file to classify the OTUs. 

Phylogenetic analysis of archaeal 16S rRNA gene sequences. Twenty-nine 
archaeal OTUs identified from the amplicon data were aligned together with 
220 sequences representing the major clades in the archaeal 16S rRNA tree 
according to the study by Durbin and Teske®. A total of 249 sequences were 
aligned with MAFFT L-INS-i (v7.012b)*', trimmed with TrimAl (v1.4), and 
subjected to a maximum-likelihood phylogeny analysis using RAxML 


(v8.0.22)° (GTRGAMMA model of nucleotide substitution and 100 bootstraps). 
The resulting tree was imported into iTOL online™ to collapse major clades. 
Phylogenetic analysis of DSAG-related OTU’s. All 16S rRNA gene sequences 
classified as DSAG by Jorgensen et al.** were used as queries in a BLAST search 
(E<10 °, identity > 83%) against all archaeal entries in the SILVA database 
(release 119) that met the following criteria: sequence length > 900 bp, alignment 
identity > 70, alignment quality > 75 and pintail quality > 75 and the quality of 
recovered sequences was checked (for example, using ‘cut-head’ and ‘cut-tail’ 
information). The number of sequences in the data set was reduced while keeping 
maximum diversity as follows. First, the retained 16S rRNA sequences were 
aligned with SINA (v1.2.11)®, using all archaea in the SILVA database as ref- 
erence. The alignment was manually curated with Seaview (v4)°*°. Upon removal 
of gaps, sequences were used to create OTUs with UCLUST (v1.2.22)* (94% 
identity cut-off and the ‘-optimal’ option). All sequences that corresponded to 
OTU seeds were selected to represent full DSAG genetic diversity and, upon 
adding archaeal outgroup sequences and the single amplicon OTU, classified 
as DSAG, the final data set was aligned with SINA (v1.2.11) as described above, 
trimmed with TrimAl (v1.4) (gap threshold of 50%) and subjected to RAxML 
phylogenetic analyses (v7.2.8; GTRGAMMA substitution model, 100 rapid boot- 
straps). All internal branches with =40 bootstrap support were collapsed with 
Newick- Utilities (v1.6)**. The resulting tree was then imported into iTOL online™ 
to collapse major clades. 

Metagenome sequencing and assembly. 

Library preparation and shotgun sequencing. Nextera libraries (Illumina) were 
prepared according to the manufacturer’s instructions, using unamplified 
LCGC14 (20 ng) and amplified LCGC14AMP (50 ng) as input DNA. Since less 
starting material was used for the generation of the unamplified library, a total of 
eight amplification cycles were used in the PCR step during which the Illumina 
barcodes and adapters (NextEra Index kit) were fused, rather than the default five 
cycles. The LCGC14 and LCGC14AMP NextEra libraries were sequenced with 
three and two lanes, respectively, of HiSeq2500 (Illumina), using rapid mode 
setting, generating two 150-bp paired reads. These runs yielded 8.6 Gbp and 
56.6 Gbp of data with an average insert size of 620 and 350 bp for the LCGC14 
and LCGC14AMP NextEra libraries, respectively. 

Read preprocessing. SeqPrep (v.b5efabc5f7, https://github.com/jstjohn/SeqPrep) 
was used to merge overlapping paired-end reads and to trim adapters, with 
default settings. Merged reads and non-merged pairs were trimmed with Sickle 
(v.1.210, https://github.com/najoshi/sickle), using “se” and “pe” options, respect- 
ively, and default settings. 

Metagenomic assembly. Pre-processed paired-end reads and single reads were 
assembled with SPAdes v. 3.0.0° in single-cell mode, to take into account the 
widely varying coverage of metagenomics contigs as well as to try to assemble 
contigs with low coverage. The read correction tool was turned on and kmers 21, 
33, 55 and 77 were used. Mismatch correction was not performed on the 
LCGC14AMP data set. Contigs shorter than 1 kbp were discarded. 

Gene predictions. Protein coding genes (CDS) were identified with prodigal v. 
2.60”, using the ‘meta’ option for metagenomes. Ribosomal RNA (rRNA) genes 
were called with mammer v.1.27', using the archaeal model and searching for all 
three rRNA subunits. Transfer RNA genes (tRNA) were identified with 
tRNAscan-SE v.1.23”, using the “-G’ option for metagenomes and ‘-A’ option 
for the Lokiarchaeum composite genome (see subsequent paragraphs). For the 
latter, the analysis was also run with SPLITSX (no version number available; 
source code downloaded on 14 August 2014)” to detect tRNA genes that are 
split or that have multiple introns. 

Protein clustering. Archaea-specific clusters of orthologous genes (arCOGs)”*, 
based on 120 archaeal proteomes (hereafter called arCOGs2012), were extended 
with proteomes from 45 recently sequenced organisms, including 31 single-cell 
amplified genomes (SAGs) (Supplementary Table 1). First, existing arCOGs were 
attributed to the new proteomes: protein sequences in each of the 10,323 
arCOGs2012 were aligned with MAFFT L-INS-i v.7.130b". Each alignment 
was used as a query (-in_msa) to search the new proteomes using PSI- 
BLAST®, ignoring the master sequence, using 10° * as an E-value cut-off, fixing 
the database size to 10°, gathering at most 1,000 sequences, and not using com- 
position-based statistics. Hits were then sorted per subject protein and, for each 
subject, the highest-scoring query alignment was deemed the main arCOG. 
Whenever applicable, the next-highest, non-overlapping query alignment was 
deemed the secondary arCOG. Second, proteins without arCOG attribution (sin- 
gletons) in both the original and extended set of proteomes were gathered, and 
new arCOGs (arCOGs2014) were created from symmetrical best hits, using the 
tools available in COG software suite, release 201204 (ref. 75). PSI-BLAST 
searches were performed according to the COG software instructions. Lineage- 
specific expansions were identified with COGlse, using a job-description file 
containing all possible pairs of organisms that do not belong to the same phylum. 
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COGtriangles was run with default settings, and yielded 3,570 new arCOGs. Of 
the 325,405 proteins in the combined data sets (165 proteomes), 29,249 (9%) had 
no arCOG attribution. 

Attribution of arCOGs to metagenomes or composite genomes in this study 
was performed with PSI-BLAST as described above, using the arCOGs2014 as 
queries. 

Phylogenetic analyses of ‘taxonomic marker’ proteins for binning and con- 
catenated protein trees. 

Phylogenetic inference. Maximum-likelihood phylogenies were inferred with 
RAxML 8.0.9%, calculating 100 non-parametric bootstraps. PROTGAMMALG 
and GTRGAMMA were used for amino acid and nucleotide alignments, respect- 
ively, unless otherwise stated. Bayesian inference phylogenies were calculated 
with PhyloBayes MPI 1.5a’°, using the CAT model and a GTR substitution 
matrix. Four chains were run, and runs were checked for convergence. 
Whenever convergence was not reached, the topology of individual chains was 
compared. Consensus trees were obtained with bpcomp, using all four chains and 
a burn-in of at least half the generations. To add bootstrap support values to the 
Bayesian phylogenies, sumtrees.py (DendroPy package”’) was used, with default 
settings, taking the Bayesian inference tree as a guide tree and the 100 bootstraps 
as input. For concatenated phylogenies, amino-acid sequences were aligned again 
with MAFFT L-INS-i individually for each cluster. Positions with >50% gaps 
were trimmed and alignments were concatenated. 

Amino acid bias filtering. To assess the effect of amino acid bias on the phylo- 
genies, a 7’ filtering analysis was performed on the concatenated alignment. For a 
complete description, see refs 78 and 79. In brief, a global ¢ score is calculated for 
the concatenated alignment, by summing, for each amino acid and each sequence, 
the normalized squared difference between the expected and observed frequency 
of the amino acid in this particular sequence and its frequency expected from the 
whole alignment. Each position in the alignment is individually trimmed and the 
difference (Az’) between the global 7° score and the 7’ score calculated on the 
trimmed alignment provides an estimation of the relative contribution of each 
position to the global amino acid composition heterogeneity. Positions are then 
ranked by their Ay’ values, and the most or least biased sites up to a threshold are 
removed. 

Tree topology tests. To compare how well different trees explained the aligned 
sequence data, approximately unbiased tests*® were performed on concatenated 
as well as single-gene alignments. Two maximum-likelihood hypothesis trees 
were tested against the alignments. The first one, showing Lokiarchaeota group- 
ing with eukaryotes, was obtained from the concatenation of 36 markers, shown 
in Fig. 2b. The second was obtained from the concatenation of the 21 ribosomal 
proteins present in the previous set, and shows Korarchaeota grouping with 
eukaryotes. For individual gene trees, the taxa missing in the alignment were also 
pruned from the hypothesis trees using the utility nw_prune from the Newick 
Utilities package™. For each alignment tested, per-site maximum likelihood was 
calculated for both hypothesis trees with RaxML 8.0.9, using the option “-fG’, and 
the PROTGAMMALG model. CONSEL 0.20*' was then used to perform 
approximately unbiased tests, using default settings. 

Identification of taxonomic markers. A reference set of 59 highly conserved, low- 
or single-copy genes were used both as taxonomic markers in the binning process 
and for concatenated phylogenies (Supplementary Table 2). Fifty-seven of these, 
which were shown to be prone to very few or no horizontal gene transfers were 
taken from ref. 79. Two further arCOGs (arCOG04256 and arCOG04267, sub- 
units A” and A’ of the DNA-directed RNA polymerase, respectively) were added 
to the set (see Supplementary Information and Supplementary Table 2 for a list 
over which arCOG is included in each phylogeny). 

Unless otherwise stated, all trees included the same set of 101 reference gen- 
omes: 58 archaeal genomes selected” from the 120 analysed by Wolf et al.”; 21 
selected from the 45 newly sequenced organisms that were also used for cluster- 
ing, some of them already analysed in Guy et al.*; two groups of three closely 
related SAGs were pooled to provide more complete proteomes; ten bacteria and 
ten eukaryotes, as in Guy et al.’ (Supplementary Table 1). To remove paralogues 
and obtain sets with at most one homologue per genome, members of each of the 
selected arCOGs were aligned with MAFFT L-INS-i and a maximum-likelihood 
phylogeny was inferred with RAxML, under a PROTCATLG model with 100 
slow bootstraps. Previously removed paralogues” were not included. Trees were 
then visually inspected and paralogues removed using the same guidelines as in 
ref. 79. This set, including at most one copy of each of the 59 reference arCOGs in 
101 genomes, is hereafter referred to as ‘59ref. 

Binning. 

Training set. After arCOG attribution (see above), genes from LCGC14AMP 
belonging to the respective arCOGs were added to the 59ref set. Sequences were 
aligned and individual trees were built for each arCOG, as described above. Trees 
were then visually inspected and sequences from LCGC14AMP were classified in 
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the following categories: Lokiarchaeum, Loki2/3 (distant Lokiarchaeaum-related 
clades), Thaumarchaeota, DPANN, Diapherotrites, Mimivirus, Bacteria or 
unknown. Classification was based on phylogenetic placement. In some cases 
where the phylogenetic placement was inconclusive, presence on the same contig 
of another gene already classified was used to aid classification. The fact that 
Lokiarchaeum is the only clade for which four to six distinct but closely related 
strains are present in LCGC14AMP greatly aided classification. In a minority of 
cases, some genes were classified in a category but marked as ‘putative’, as their 
attribution was slightly ambiguous. 

Quality control of the training set. Contigs containing markers classified in the 
first six categories mentioned above were extracted from the assembly, and their 
tetranucleotide frequencies (TNF) were calculated. To then assess the reliability of 
the classification, linear discriminant analysis (LDA) was performed in R® with 
package MASS™, using GC content and TNF as input data: half of the contigs 
belonging to each of the six selected categories were randomly selected (excluding 
the contigs marked as ‘putative’), and used to calculate LDA (function ‘Ida’ in 
MASS) (Supplementary Fig. 4). Based on this, classification was predicted using 
the MASS function predict.lda. Incorrect predictions (that is, when the prediction 
based on LDA was not congruent with the classification based on the phylogen- 
etic trees) were recorded. The procedure was repeated 100 times, and contigs that 
were attributed to the wrong category 30 times or more were manually reviewed 
and eventually discarded from the training set (Supplementary Fig. 4a). Contigs 
marked as putative were attributed to the category if the prediction was congruent 
with the putative classification 90 times or more, or discarded otherwise. A 
further cycle of LDA calculation and prediction was performed, with no contigs 
classified as ‘putative’ this time (Supplementary Fig. 4b). To further investigate 
the robustness of the method, we randomized the categories of the input and 
performed the same LDA calculation and prediction as above, and assessed the 
number of incorrect predictions in each case (Supplementary Fig. 4c). This test 
confirmed that classifications based on trees were generally congruent with pre- 
dictions based on LDA, significantly more often than just by chance 
(Supplementary Fig. 4d-f). The final set of contigs was used as a training set 
for phymmBL**(see below), and comprised 839 kbp for Lokiarchaeum, 544 kbp 
for Loki2/3, 521 kbp for Thaumarchaeota, 646 kbp for DPANN, 43 kbp for 
Diapherotrites and 21 kbp for Mimivirus. 

Binning using PhymmBL. PhymmBL version 4.0* was run separately for binning 
the contigs larger than 1 kbp from both LCGC14AMP and LCGC14. As training 
sets, all prokaryotic genomes published in GenBank (retrieved on 2014-03-04, 
2716 genomes) were complemented with the 60 newly sequenced genomes used 
to constitute the arCOG set that was absent from GenBank (Supplementary Table 
1), and the six training sets (Lokiarchaeum, Loki2/3, Thaumarchaeota, DPANN, 
Diapherotrites and Mimivirus) obtained from LCGC14AMP as described above. 
Reassembly of Lokiarchaeum bin. In the LCGC14 assembly, 3,165 contigs (18.6 
Mbp in total) were predicted to belong to the Lokiarchaeum genus, indicating a 
large degree of microdiversity. In order to reduce redundancy, contig sets were 
constituted, with increasing low-coverage cut-offs (from 1 to 100, with a 1X 
increment). The completeness and redundancy of each set was then estimated 
using the micomplete script (manuscript in preparation). In brief, micomplete 
bases its predictions on the presence or absence of a set of single-copy pan- 
orthologs, in this case 162 markers defined in ref. 86. To avoid overemphasizing 
the presence of markers that are often very close to each other (for example, 
ribosomal proteins), each marker receives a weight coefficient based on the dis- 
tance between this marker and its closest neighbours both upstream and down- 
stream, averaged over a representative set of 70 Archaea (set described in ref. 79). 
Completeness is the fraction of weighted markers present, and is thus constrained 
between 0 (no marker present) and 1 (all markers present). Redundancy is cal- 
culated as the total number of copies of weighted markers present divided by the 
number of weighted markers present, and is thus always greater than one, where 
one would mean that all markers present are single copy. These two numbers 
were calculated for each contig set, and a cut-off of 24x represented the best 
compromise between completeness (0.89) and redundancy (1.67) (Loki24% set, 
Supplementary Fig. 5a). 

To obtain a better assembly with longer contigs with only reads from 
Lokiarchaeum, reads belonging to Lokiarchaeum contigs were reassembled as 
follows. Reads from the LCGC14 data set, corrected by SPAdes, were mapped 
against the whole LCGC14 assembly with bwa-mem’’, and reads that matched 
contigs in the Loki24X set were extracted. For paired-end reads, both reads were 
retained if at least one read matched the Loki24x set. These extracted reads were 
assembled with SPAdes as above, but without the single-cell mode and without 
read correction. Again, completeness and coverage were assessed for sets of con- 
tigs with increasing low-coverage cut-offs, and a threshold of 20X coverage was 
found to give the best compromise between completeness (0.92) and redundancy 
(1.44) (Supplementary Fig. 5b). The selected 504 contigs, hereafter referred to as 
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‘Lokiarchaeum’, represented 5.14 Mbp of sequence. The N50 and N90 of this 
assembly were 15.4 and 5 kbp, respectively. 

Annotation and contamination assessment of Lokiarchaeum genome bin. 
Annotation of all predicted open reading frames of the Lokiarchaeum genome 
bin was done using prokka**, using a concatenation of the three kingdom-specific 
protein databases shipped with prokka as the main database, predicting tRNA 
and rRNA as above. Furthermore, proteins were compared to sequences in 
NCBI’s non-redundant database and RefSeq using BLAST* and results were 
inspected using MEGAN”. Additionally, an InterProScan 5” (which integrates 
a collection of protein signature databases such as BlastProDom, FPrintScan, 
HMMPIR, HMMPfam, HMMSmart, HMMTigr, ProfileScan, HAMAP, 
PatternScan, SuperFamily, SignalPHMM, TMHMM, HMMPanther, Gene3D, 
Phobius and Coils) was performed and the genome was viewed and analysed 
in MAGE”’ Selected genes of interest for the evolution of the eukaryotic cell and/ 
or subjected to phylogenetic analyses were checked manually and annotated 
according to their protein domains/signatures based on PSI-BLAST® results, 
arCOG attributions (Supplementary Tables 6-10) as well as protein structure 
predictions using Phyre2”. To check for the presence of particular genes of 
interest, such as specific eukaryotic ribosomal proteins, or eukaryotic ribosomal 
protein L41e which has been detected in several Euryarchaeota”’, existing align- 
ments from arCOGs and/or KOGs were downloaded from eggNOG™ and used in 
PSI-BLAST searches as query against the Lokiarchaeal composite genome. 

Several controls were performed to confirm the absence of obvious contami- 
nants in the final Lokiarchaeum bin. Most importantly, all contigs containing 
ESPs discussed in the manuscript were manually inspected to verify that these 
actually belong to Lokiarchaeum, by: (1) inspecting neighbouring genes for the 
presence of archaeal markers; (2) by querying all proteins present on contigs 
containing ESPs against the LCGC17 metagenome to check whether highly sim- 
ilar homologues could be found several times in the sample (generally between 
3-7 copies) accounting for the different, highly related Lokiarchaeota strains; and 
(3) by querying the same proteins against environmental metagenomes publicly 
available at NCBI, controlling that most of them had highly similar homologues 
in an ocean sediment metagenome”, but not in any other metagenome. This last 
check was based on our finding that all ESPs of Lokiarchaeum had highly similar 
homologues in this marine sediment metagenome (for example, up to 98% for 
Lokiactins) indicating that closely related genomes of members of Lokiarchaeota 
are present, which is in accordance with the finding that DSAG represents an 
abundant group in these sub-seafloor sediments”. 

Finally, proteins comprising informational processing machineries were also 

investigated using MEGAN”. The absence of bacterial informational processing 
proteins indicated that there is no bacterial contamination in the final bin (see 
Supplementary Discussion 3). 
Identification of taxonomic markers in the bins. For Lokiarchaeum, arCOG 
attribution was performed as described above, and taxonomic markers were iden- 
tified by their arCOG attribution. Whenever there were two copies of the same 
marker, the copy located on the contig with the highest coverage was selected. 

For Loki2/3, the category had two copies of 19 out of 36 markers present, with 
divergent phylogenetic placement. A clear GC content difference could also be 
observed between the copies, and, with a single exception, the two sets of copies 
were not overlapping (Supplementary Fig. 6). The exception was discarded and 
the remaining two-copy markers were divided into two bins, Loki2 (high GC, 
ranging between 32.2-37.3%) and Loki3 (low GC, ranging between 27.7-30.7%). 
Single-copy markers with a GC content falling into the range of either of Loki2 or 
Loki3 were attributed to the corresponding bin, the other copies were discarded. 
Loki2 (high GC, average 32.8%) consisted of 21 markers and Loki3 (low GC, 
average 29.9%) of 34 markers. 

Taxonomic affiliation of the Lokiarchaeum proteome. To estimate how 
Lokiarchaeum relates to its closest relatives, its proteome was aligned to 
NCBI’s non-redundant database using blastp, with an E threshold of 0.001. To 
provide a way to compare results, the complete proteomes of ‘Candidatus 
Korarchaeum cryptofilum’ OPF8, ‘Candidatus Caldiarchaeum subterraneum’ 
and the incomplete proteome of SCGC AB-539-E09, sole representative of the 
Miscellaneous Crenarchaeotal group (MCG) were similarly analysed. The results 
of the blasts were filtered to remove self-hits and hits to organisms belonging to 
the same phylum. In the case of the MCG representative, only self-hits were 
removed. Filtered results were then analysed with MEGAN 5.4.0. Last common 
ancestor parameters were set as follows: Min Score, 50; Max Expected, 0.01; Top 
Percent, 5; Min Support, 1; Min Complexity, 0.0. For each result, branches were 
uncollapsed at the level below super-kingdom. Profiles were compared using 
Absolute counts, and the results were exported and further analysed in R. 
Categories to which less than 100 hits were attributed in Lokiarchaeum were 
grouped under the ‘Other Archaea’ or ‘Other Bacteria’ categories. Hits to ‘root’, 
viruses, unclassified sequences and hits not assigned were grouped under the 


‘Other’ category. Results are shown in Fig. 2b. Using the same parameters, func- 
tional COG categories were assigned to the Lokiarchaeal proteome to get insights 
into the functional and taxonomic affiliation of the Lokiarchaeal proteome 
(Supplementary Fig. 15). 

Phylogenetic analyses of selected eukaryotic signature proteins (ESPs). 
Selection of ESCRT-III homologues. For the ESCRT-III phylogeny, eukaryotic 
ESCRT-III homologues as described in Makarova et al.”’ (comprising the families 
Vps60/Vps20/Vps32 and Vps46/2/24), as well as archaeal ESCRT-III homolo- 
gues belonging to arCOG00452, arCOG00453 and arCOG00454 families present 
in Crenarchaeota, Thaumarchaeota and Aigarchaeota were extracted from 
GenBank. The more distantly related SNF7-like arCOG families (arCOG09747, 
arCOG09749 and arCOG07402)” present in a few euryarchaeal species were not 
included in the alignment. Subsequently, respective arCOGs were retrieved from 
both the LCGC14AMP metagenome and Lokiarchaeum final bin (see section on 
arCOG attribution). The ESCRT operon present on a Loki2/3 contig revealed the 
presence of an additional ESCRT-III homologue (most similar to eukaryotic 
Vps20/32/60 sequences), which was not attributed to an archaeal COG. This 
homologue was used as an additional query to retrieve highly similar sequences 
from the LCGC14AMP metagenome as well as the Lokiarchaeum final bin using 
blastp. Finally, each of the two different SNF7-family proteins, which are part of 
the ESCRT operons of Lokiarchaeum and Loki2/3, respectively, were used as 
queries to search published metagenomes (NCBI) with blastp. Highly similar 
sequences (coverage > 70%; identity > 40%) were retrieved and included in the 
phylogeny as well. 

Selection of Vps4 homologues. Archaeal sequences assigned to arCOG01307 (cell 
division ATPase of the AAA+ class, ESCRT system component) as well as 
eukaryotic Vps4 homologues, including a few proteins of the cdc48 subfamily, 
were retrieved from GenBank. The latter protein family served as outgroup, as 
described in Makarova et al.”” Sequences assigned to arCOG01307 were also 
extracted from LCGC14AMP metagenome as well as from the Lokiarchaeum 
bin, and sequences highly similar to the Vps4 of Lokiarchaeum were 
retrieved from published metagenomes (coverage > 60%; identity > 50%). The 
LCGC14AMP metagenome contained a large amount of sequences assigned to 
arCOG01307, including hits to Vps4 homologues of Thaumarchaeota. However, 
ATPases that, based on phylogenetic analyses, turned out to be unrelated to Vps4 
were removed from the analysis. Based on the initial phylogeny that included all 
of these sequences, only those LCGC14AMP Vps4 homologues that clustered 
with the Vps4 homologue of the Lokiarchaeum bin were selected to avoid the 
inclusion of false positives. 

Selection of EAP30-domain (Vps22/36-like) and Vps25 homologues. EAP30 and 
Vps25 homologues have so far not been detected in Archaea and thus the respect- 
ive sequences present in Lokiarchaeum (Extended Data Table 1 and 
Supplementary Table 6) have not been assigned to an arCOG family. Thus, only 
Lokiarchaeum homologues, as well as selected representative eukaryotic 
sequences spanning the eukaryotic diversity that were retrieved from the 
GenBank database were included in these phylogenetic reconstructions. 
Putative EAP30- and Vps25-like homologues were discovered in the 
Lokiarchaeum genome since they are part of the ESCRT operon present on 
contig119. These sequences were used as queries to also retrieve homologues 
from the LCGC14AMP metagenome (E cut-off, 0.1; q coverage, 85) as well as 
from metagenomes deposited at NCBI. 

Selection of small GTPase family homologues (IPR006689 and IPR001806). The 
investigation of the Lokiarchaeum proteome revealed large numbers of proteins 
homologous to small GTPases of the Ras and Arf families. In order to reliably 
identify all putative small GTPases in the Lokiarchaeum bin, an InterPro scan”””* 
was performed and all proteins assigned to IPRO06689 (Ras type of small 
GTPases) and IPRO01806 (Arf/Sar type of small GTPases) were extracted. 
Subsequently, archaeal reference sequences belonging to these IPR families were 
retrieved from GenBank. Eukaryotic and bacterial reference sequences were 
selected based on a previous study by Dong et al.” that investigated the phylo- 
genetic relationships of members of the Ras superfamily. Due to the large number 
of GTPase homologues in the Lokiarchaeum bin, and the difficulty assigning 
these proteins to a particular taxon, it was decided not to analyse all GTPase 
homologues present in metagenomes. Upon inspection of the MAFFT L-INS-i 
alignment, partial sequences and extremely divergent homologues were removed. 
Selection of actin homologues. So far, the only actin-related proteins detected in a 
few members of the archaea belong to arCOG05583 and have been referred to as 
crenactins'®. Three proteins encoded by the Lokiarchaeum genome were 
assigned to this arCOG, and a blastp search against RefSeq revealed that these 
proteins are more closely related to bona fide actins of Eukaryotes than to archaeal 
crenactins. In order to identify additional full-length actin homologues, blastp 
(E-value cut-off<10 '°) searches were performed against the Lokiarchaeum 
genome as well as the LCGC14AMP metagenome, using this Lokiarchaeum actin 
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homologue as query. Finally, a total of five and 42 full-length (>180 amino acids) 
actin-related proteins were retrieved from the Lokiarchaeum bin and from the 
LCGC14AMP metagenome, respectively. These sequences were merged with the 
archaeal protein sequences belonging to arCOG05583 as well as with major 
eukaryotic actin families (actins and ARP1-3 (refs 101, 102)). We also assessed 
the phylogenetic position of the bacterial actin-related protein (BARP) of the 
bacterium Haliangium ochraceum’” in light of the new Lokiarchaeal actin homo- 
logues, and concluded that the Haliangium BARP was most likely acquired via 
horizontal gene transfer from eukaryotes. 

Phylogenetic reconstructions. For all of these ESPs, the selected sequences were 
aligned using MAFFT L-INS-i*' and trimmed with TrimAl]* to retain only those 
columns present in at least 50% (for ESCRT-III; Vps4; actin homologues), 40% 
(EAP30-domain and Vps25 homologues) and 80% (small GTPases) of the 
sequences. Alignments were visually inspected and manually edited whenever 
necessary and subsequently subjected to maximum-likelihood phylogenetic ana- 
lyses using RAxML (8.0.22, PROTGAMMALG) with the slow bootstrap option 
(100 bootstraps). 

Contig maps. The contig maps displayed in Fig. 4a were drawn with the software 
genoPlotR v.0.8.2 (ref. 104). 
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Extended Data Table 1 


Overview of Lokiarchaeal ESPs 


Suggested 
function 


Product 


Locus tag 


IPR-domains 


Comment 


Putative ESCRT-III 
proteins 


Vps2/24/46-like protein* 


Lokiarch_37480 


IPRO05024 Snf7 


Vps20/32/60-like protein* 


Lokiarch_16760 


IPRO05024 Snf7 


More distant homologs also present 
in several other members of the 
TACK superphylum. 


Putative ESCRT-II 
proteins 


EAP30 domain protein 
(Vps22/36-like)* 


Lokiarch_37450 


IPROO7286 EAP30 


Vps25-like protein* 


Lokiarch_37460 


IPRO14041 ESCRT-II complex, Vps25 
subunit, N-terminal Winged helix; 
IPRO08570 ESCRT-II complex, Vps25 
subunit; IPRO11991 Winged helix-turn- 
helix DNA-binding domain 


Previously not found in Archaea. 


Putative ESCRT-I 
protein 


Hypothetical protein with 
Vps28-like domaint 


Lokiarch_10170 


IPROO7143 Vacuolar protein sorting- 
associated, Vps28 


Vps28 is part of ESCRT-I, potential 
interacting protein Lokiarch_16740 
(see Table S6). 


Putative ESCRT- 
associated protein 


Vps4 ATPase* 


Lokiarch_37470 


IPRO03959 ATPase, AAA-type, core; 
IPRO27417 P-loop containing nucleoside 
triphosphate hydrolase; IPRO03593 
AAA+ ATPase domain; IPR007330 MIT- 
domain 


Also present in other members of 
the Archaea. 


Putative vesicular 
trafficking 
machinery 
associated proteins 


Hypothetical proteins 
vacuolar fusion domain 
MON(1* 


Lokiarch_21780 
Lokiarch_01670 
Lokiarch_15160 


IPRO04353 Vacuolar fusion protein 
MON1 


Hypothetical proteins with 
longin-like domains 


Lokiarch_01890 
Lokiarch_13110 
Lokiarch_03280 
Lokiarch_22790 
Lokiarch_04850 


IPRO11012 Longin-like domain; 
IPRO10908 Longin domain 


Previously not found in other 
prokaryotic organisms 

(see Table S6 and Table S10 for 
more details) 


BAR/IMD domain-like 
superfamily protein* 


Lokiarch_46220 
Lokiarch_08900 


IPRO04148 BAR domain; IPRO09602 
FAM92 protein 


Includes various protein families 
that bind membranes and detect 
membrane curvature. 


Cell division/ 
cytoskeleton related 
proteins 


Actin and related proteins* 


Lokiarch_44920 
Lokiarch_36250 
Lokiarch_10650 
Lokiarch_09100 
Lokiarch_41030 


IPRO04000, Actin-related protein; 
IPRO20902 Actin/actin-like conserved site 


Some Cren- Kor- and Aigarchaeota 
encode crenactins 7° 
(arCOG05583) 


Hypothetical proteins with 
gelsolin-like domains* 


12 proteins, 
Suppl Table S6 


IPRO07122 Villin/Gelsolin; IPRO29006 
ADF-H/Gelsolin-like domain; IPRO07123 
Gelsolin-like domain 


Previously not found in Archaea. 
Serve as candidates for potential 
actin-binding proteins. 


Small GTP-binding domain 
proteins with Ran-/Ras- 
/Rab-/Rho- and Arf-domain 
signatures*® 


92 proteins, see 
Suppl Table S6 


IPROO1806 Small GTPase superfamily; 
IPRO03579 Small GTPase superfamily, 
Rab type; IPR0O27417 P-loop containing 
nucleoside triphosphate hydrolase; 
IPRO20849 Small GTPase superfamily, 
Ras type; IPRO02041 Ran GTPase; 
IPRO03578 Small GTPase superfamily, 
Rho type; IPR005225 Small GTP-binding 
protein domain; IPRO24156 Small 
GTPase superfamily, ARF type 


Extreme proliferation of small GTP- 
binding proteins in Lokiarchaeum 
(92 proteins in composite genome, 
see Fig. 3b and c); in addition 
Lokiarchaeum encodes 12 
Roadblock/LC7 domain proteins, 
which might serve as GTPase 
activating enzyme (see Suppl Table 
S6). 


Ubiquitin modifier 
system related 


Ubiquitin-like proteins* 


Lokiarch_29280 
Lokiarch_29310 
Lokiarch_37670 


IPRO29071 Ubiquitin-related domain; 
IPR000626 Ubiquitin-like 


Putative E1-like ubiquitin 
activating protein 


Lokiarch_15900 
Lokiarch_29320 


IPRO23280 Ubiquitin-like 1 activating 
enzyme, catalytic cysteine domain; 
IPRO19572 Ubiquitin-activating enzyme 
(see SOM for more details) 


Lokiarch_10330 


IPRO16135 Ubiquitin-conjugating 


Ubiquitin modifier system was 
previously identified in 
Aigarchaeota’’: Canonical E3 
ubiquitin ligases are not present in 
Caldiarchaeum subterraneum and 
Lokiarchaeum. However, both 


ribosomal protein 


eukaryotic ribosomal 
protein L22e" 


Lokiarch_30160 


proteins Putative E2 tke ubiquitin | Lokiarch_41760 | enzyme/RWD-like; IPROO0608 Ubiquitin- | archaeal genomes contain RING- 
jugating P Lokiarch_29330 conjugating enzyme, E2 net bee its could serve as 
Hypothetical proteins with Lokiarch_29340 Loker 4401 0 ee cup he 
Lokiarch_43590 IPROO0555 JAB1/MPN/MOV34 = 
JAB1/MPN/MOV34 _ : S6). 
metalloenzyme domain Lokiarch_26830 metalloenzyme domain 
Lokiarch_08140 
Putative homolog of Previously not found in Archaea. 
Eukaryotic 9g Best blast hit: gb|EPR78232.1| 60S 


ribosomal protein L22 [Spraguea 
lophii 42_110] - Expect = 0.21 


Oligosaccharyl 
transferase 
complex proteins 


Ribophorin 1 superfamily 
protein 


Lokiarch_43710 


IPROO7676 Ribophorin | 


Previously not found in Archaea 


Putative oligosaccharyl 
transferase complex, 
subunit OST3/OST6 


Lokiarch_24040 
Lokiarch_25040 


IPRO21149 Oligosaccharyl transferase 
complex, subunit OST3/OST6 


Previously not found in Archaea. 


Putative oligosaccharyl 
transferase STT3 subunit 


Lokiarch_28460 


IPRO03674 Oligosaccharyl transferase, 
STT3 subunit 


Homologs also present in some 
other Archaea. 


Locus tags that are highlighted in bold indicate a significant top blast hit of the respective protein of Lokiarchaeum to a eukaryotic sequence (see Supplementary Table 6 for further details). 
* Phylogenetic analyses have been performed. 
+ Alignments shown in Supplementary figures. 
{Protein domain assignments for these proteins listed in Supplementary Table 10. 
§ While most small GTPases encoded by Lokiarchaeum have highest similarity to eukaryotic homologues, approximately 10% are most similar to Archaea and/or Bacteria (see Supplementary Table 6 for more 


details). 
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Neurons for hunger and thirst transmit a 
negative-valence teaching signal 


J. Nicholas Betley'*, Shengjin Xu'*, Zhen Fang Huang Cao!*, Rong Gong’, Christopher J. Magnus', Yang Yu' & Scott M. Sternson! 


Homeostasis is a biological principle for regulation of essential physiological parameters within a set range. Behavioural 
responses due to deviation from homeostasis are critical for survival, but motivational processes engaged by 
physiological need states are incompletely understood. We examined motivational characteristics of two separate 
neuron populations that regulate energy and fluid homeostasis by using cell-type-specific activity manipulations in 
mice. We found that starvation-sensitive AGRP neurons exhibit properties consistent with a negative-valence teaching 
signal. Mice avoided activation of AGRP neurons, indicating that AGRP neuron activity has negative valence. AGRP 
neuron inhibition conditioned preference for flavours and places. Correspondingly, deep-brain calcium imaging 
revealed that AGRP neuron activity rapidly reduced in response to food-related cues. Complementary experiments 
activating thirst-promoting neurons also conditioned avoidance. Therefore, these need-sensing neurons condition 
preference for environmental cues associated with nutrient or water ingestion, which is learned through reduction of 


negative-valence signals during restoration of homeostasis. 


AGRP neurons are a hypothalamic population that is activated or 
inhibited by hormonal signals of energy deficit or surfeit, respect- 
ively’. AGRP neuron ablation or inhibition suppresses feeding”’, 
and activation elicits food consumption and instrumental food-seek- 
ing within minutes***, indicating that these neurons are an entry 
point to motivational processes resulting from a homeostatic deficit®. 
Because food preferences and food-seeking behaviours are learned, in 
part, as a consequence of nutrient intake’, we investigated the cap- 
ability of AGRP neurons to directly influence learning in mice. 

Multiple learning processes contribute to feeding behaviour**. 
Behavioural responses to Pavlovian conditioning are typified by 
approach or avoidance to cues that have been associated with a rein- 
forcer*"®, such as learning preference for a nutritive food over a non- 
nutritive object. Instrumental conditioning is a process by which an 
animal learns to perform an action that elicits a valued outcome, such 
as lever pressing for food. Neurons that increase food-seeking and 
consumption in homeostatic hunger may influence these learning 
processes in two distinct ways. Approach to cues and performance 
of actions associated with food ingestion can be strengthened through 
the intrinsic positive valence of nutritive food’, which is potentiated 
during energy deficit*’*"* (Extended Data Fig. 1a). Alternatively, 
preference or performance of actions can be conditioned by reducing 
states with negative valence'’'*"'* (Extended Data Fig. 1b). For neu- 
rons that elevate food consumption, their valence can be distin- 
guished, in the absence of food, by whether an animal learns to 
prefer cues that are associated with increased or decreased activity 
of these neurons, respectively. Influential experiments with brain 
stimulation in the lateral hypothalamus found neurons that elicit food 
intake, have positive valence, and facilitate both Pavlovian and instru- 
mental learning'”’*. Although early behavioural theories assumed 
that homeostatic deficits controlled need-based behaviours by redu- 
cing a negative internal state’, neurons that increase food intake have 
not been reported to signal negative valence*"’. 

Nevertheless, human subjects report that energy deficit is unpleas- 
ant and eating can alleviate this feeling’”-*’, which indicates a possible 


role for a poorly understood negative-valence state. Neuronal systems 
that elicit food-seeking and also mediate the negative valence assoc- 
iated with a homeostatic hunger state have not been identified. Here, 
we used cell-type-specific neuronal activity manipulations and deep- 
brain in vivo imaging to determine that hunger-promoting AGRP 
neurons can influence learning and behaviour through negative-val- 
ence states. 


AGRP neurons condition flavour preference 


To investigate whether elevated AGRP neuron activity transmits a 
negative-valence signal, we performed flavour preference condition- 
ing using ad libitum fed mice expressing channelrhodopsin-2 (ChR2) 
in AGRP neurons (AGRP@®?) (Fig. la—d). AGRP©? mice and a 
control AGRP“*" group were habituated to consume two differently 
flavoured non-nutritive gels and were then conditioned by separately 
consuming one flavour during photostimulation and the other with- 
out AGRP neuron activation (Fig. le). After conditioning, the pref- 
erence for the flavour consumed by AGRP“* mice during AGRP 
neuron photostimulation was reduced (Fig. 1f). To check whether 
AGRP neuron photostimulation elicits a general aversive state ana- 
logous to the nausea-inducing agent LiCl, we performed a condi- 
tioned taste aversion test by pairing a novel taste (saccharin 
solution) with subsequent photoactivation of AGRP neurons; there 
was no resulting aversion to saccharin (Extended Data Fig. 2a-c). 
Together, these experiments demonstrate that a flavour cue associated 
with high levels of AGRP neuron activity became less preferred, indi- 
cating that these neurons transmit a negative-valence signal, but 
AGRP neurons do not appear to elicit strong aversion or disgust, 
consistent with the observation that AGRP neuron activation leads 
to copious food consumption”. 

AGRP neuron activity is normally elevated during energy deficit”. 
If AGRP neurons contribute to feeding behaviours through a negative- 
valence signal, then inhibition of AGRP neurons during food restriction 
would be expected to facilitate learning (Extended Data Fig. 1b). For cell- 
type-specific chemogenetic inhibition, Agrp-IRES-Cre mice were virally 


1Janelia Research Campus, Howard Hughes Medical Institute, 19700 Helix Drive, Ashburn, Virginia 20147, USA. 
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Figure 1 | AGRP neurons condition flavour preference. a, Optical fibre 
position over the arcuate nucleus (ARC). b, ChR2-eYFP in AGRP neurons 
(box). Scale bar, 1 mm. c, d, Fos immunofluorescence following 
photostimulation in AGRP*C!? (c) or AGRP™ (d) mice. Scale bar, 100 jum. 
e, Experimental design of conditioned flavour preference assay in ad libitum fed 
AGRP“” or AGRP*C? mice. f, Change in preference for flavour paired with 
light in AGRP°C™? and AGRP“" mice (eGEP, n = 6; ChR2, n = 8). 

g, h, Injection of rAAV (g) for Cre-dependent expression of PSAM™"4!"-GlyR- 
IRES-eGFP in (h) AGRP neurons. i, Image of virally transduced AGRP neurons 


transduced to express a pharmacologically selective ligand-gated chlor- 
ide channel, PSAM""4".GlyR (AGRPPSAM- GDR mice) and enhanced 
green fluorescent protein (eGFP) (Fig. 1g—-i and Extended Data Fig. 2d). 
As previously characterized”, intraperitoneal (i.p.) administration of the 
channel’s cognate selective synthetic ligand (PSEM*”®) inhibited AGRP 


transduction 


Food intake reduction (%) 


Conditioning 


showing PSAM""4!"_GlyR-IRES-eGFP expression. Scale bar, 1 mm. j, Chow 
food intake reduction for food-restricted mice treated with PSEM*’® grouped 
by transgene transduction efficiency (low, <50%, n = 13; high, >50%, n = 16). 
k, Change in preference for flavour paired with PSEM*”® injection in food- 
restricted AGRPPSAM-GHR mice. 1, Change in preference correlates with 
reduction of chow food intake (n = 29 mice). m, Flavour preference pre- and 
post-conditioning for mice grouped by post hoc food intake reduction test. 
*P < 0.05, **P < 0.01, ***P < 0.001. Values are means + s.e.m. Statistical 
analysis in Extended Data Table 1. 


neuron activity (Extended Data Fig. 2e-g), and reduced food consump- 
tion (Fig. 1j and Extended Data Fig. 2h) to an extent that correlated with 
the transgene transduction efficiency (Extended Data Fig. 2i-n). We 
measured flavour preference associated with AGRP neuron silencing 
in food-restricted AGRP?SAM SR mice (85-90% body weight). 
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Figure 2 | AGRP neurons condition place preference. a, Experimental design 
of place preference conditioning with chemogenetic silencing. Red bar, 
chemogenetic silencing side. b, For an AGRP’S“™-G¥8 mouse that was food- 
restricted during conditioning, scatter plot of position and a heat map showing 
per cent occupancy time. c, Change in occupancy time for food-restricted (FR) 
AGRP*?? mice (n = 13), food-restricted AGRPPSA™ GOR mice (n = 20) or 
ad libitum (AL) AGRP?SAM-SUR mice (n = 9) with >50% PSAM""41¥-GlyR 
transduction efficiency after conditioning with PSEM®*”® injections. d, e, 
Change in occupancy time for side paired with PSEM*”® is correlated with 
PSAM"“!¥_GlyR transduction efficiency (n = 26 mice) (d) and chow food intake 
reduction for mice treated with PSEM*’S (n = 35 mice) (e). f, Preference shift is 
correlated for place and flavour conditioning. g, Experimental design for place 


conditioning during optogenetic activation. Blue bar, photostimulated side. 

h, For an AGRP“” mouse, scatter plots of position and heat maps showing per 
cent occupancy time. i, j, Per cent occupancy time on photostimulated side for 
AGRP*CF? (open circles, n = 12) or AGRP® (filled circles, n = 12) mice during 
15 min conditioning sessions (two-way repeated measures ANOVA, group: 
F154) = 3.0, P = 0.097; session: F(7,154) = 2.3, P = 0.029; interaction: Fi7.154) = 
3.3, P = 0.003) (i) and second half of each 15-min session (group: Fy,154) = 6.4, 
P = 0.019; session: Fi7,154) = 3.2, P = 0.004; interaction: Fy7,1s4) = 3.8, P< 0.001) 
(j). k, Change in occupancy time on the previously photostimulated side for 
AGRP*?? (open circles, n = 12) or AGRP® (filled circles, n = 12) mice during 
1,800 s extinction session. *P < 0.05, **P < 0.01, ***P < 0.001. Values are 
means + s.e.m. Statistical analysis in Extended Data Table 1. 
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Preference increased for the flavour consumed during AGRP neuron 
inhibition (Fig. 1k), and the change in flavour preference correlated with 
reduction of chow re-feeding by AGRP neuron silencing (Fig. 1], m). 
Therefore inhibiting AGRP neuron activity in energy deficit conditions 
flavour preference, consistent with suppression of a negative-valence 
signal. 


AGRP neurons condition place preference 


We also examined whether AGRP neurons influenced preference for 
contextual cues independently of ingestive behaviours. First, we per- 
formed place preference conditioning by inhibiting AGRP neurons in 
the absence of food (Fig. 2a). Using a two-sided chamber, food- 
restricted but not ad libitum fed AGRP’S“™-S!"8 mice increased occu- 
pancy time in the chamber paired with PSEM*’’, whereas control 
AGRP*'? food-restricted mice did not shift place preference 
(Fig. 2a-c). The extent of the shift in occupancy time for food- 
restricted mice positively correlated with the transduction efficiency 
of the PSAM"'*!"_GlyR-IRES-eGFP transgene (Fig. 2d) as well as a 
post hoc food intake reduction test during AGRP neuron silencing 
(Fig. 2e). Furthermore, for a subset of mice that were subjected 
sequentially to both conditioned place preference and flavour pref- 
erence tests with AGRP neuron silencing, the magnitude of the pref- 
erence shift was correlated between the two conditioning assays 
(Fig. 2f). This demonstrates a conditioning process in which reduc- 
tion of electrical activity in this neuron population during energy 
deficit reduces food intake and also increases preference for associated 
contextual and flavour cues. 

We next investigated whether the negative-valence properties of 
AGRP neuron activation influenced conditioned place preference. 
Although passive conditioning to AGRP neuron stimulation was 
neither sufficient to reliably change place preference” (A occu- 
pancy time: 28.4 + 72 s; P = 0.70 paired t-test, n = 10) nor to 
oppose cocaine-conditioned place preference (A preference, AGRP“*”/ 
coc: 25.2 + 4.2%, n = 6; AGRP/coc: 32.7 + 8.6%, n = 6; P = 
0.45), we sought to determine if the contrast between high and low 
AGRP neuron activity was learned more effectively. Following 
passive conditioning, mice were tested each day while freely explor- 
ing the conditioning apparatus, and AGRP neuron photostimula- 
tion was triggered whenever the mouse entered the side previously 
exposed to photostimulation (Methods and Fig. 2g). Over the 
course of multiple sessions, mice showed avoidance of the side 
paired with AGRP neuron photostimulation (Fig. 2h, i). This effect 
was more robust for the second half of the closed-loop place pref- 
erence session (Fig. 2j), which is probably related to the previously 
reported minutes-long latency of AGRP neuron stimulation to 
evoke feeding*. In a subsequent extinction test in the absence of 
photostimulation, mice preferred the side that had been associated 
with cessation of AGRP neuron photostimulation (Fig. 2k and 
Extended Data Fig. 3a, b). These experiments further indicate that 
AGRP neurons transmit a negative-valence signal, and discontinu- 
ing AGRP neuron photostimulation can condition preference for 
contextual cues. 

The temporal properties and the magnitude of the response to 
passive place conditioning indicate that AGRP neuron activation is 
not a strongly aversive stimulus, such as a shock"®, that is capable of 
eliciting goal-directed instrumental avoidance responses. Indeed, 
attempts to condition mice to perform an instrumental action 
(lever-press or nose-poke) to either shut off AGRP neuron photosti- 
mulation in well-fed mice or to optogenetically silence AGRP neurons 
in food-restricted mice were unsuccessful (see Methods and Extended 
Data Fig. 3c-i). Therefore these experiments indicate that AGRP 
neuron activity is associated with a negative-valence signal that can 
mediate Pavlovian learning, but this property does not readily extend 
to instrumental conditioning. 
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Figure 3 | Modulation of instrumental responding for food. a, Experimental 
design. AGRP“™ mice were trained to lever press for food pellets. PR7 
reinforcement testing was performed over 15 sessions on two groups: food- 
restricted (black, n = 11) or ad libitum fed AGRP neuron photostimulated 
(cyan, n = 11). During test sessions (120 min), levers were available for the first 
40 min. The AGRP neuron photostimulated group received intracranial light 
pulses for the entire 120 min session. b, Lever presses in each session during PR7 
reinforcement. c-e, Lever presses (c), pellets earned (d) and breakpoint ratio 
(e) from first (1) and last (15) PR7 reinforcement test sessions. f, Representative 
traces of cumulative lever pressing during first (1) and last (15) sessions. g, Rate 
of lever pressing during first 10 min of session (low effort reinforcement, filled 
circles) and rest of session (high-effort reinforcement, open circles) on first (1) 
and last (15) PR7 session. NS, P > 0.05, *P < 0.05, **P < 0.01, ***P < 0.001. 
Values are means + s.e.m. Statistical analysis in Extended Data Table 1. 


Modulation of instrumental food-seeking 


Although manipulation of AGRP neuron activity does not appear to 
directly reinforce instrumental responses, AGRP neuron activation 
leads to vigourous performance of previously learned instrumental 
food-seeking responses*®. We examined whether AGRP neuron- 
evoked instrumental responding for food may be sensitive to the 
negative valence of these neurons. If AGRP neuron activity influences 
food-seeking behaviours through a negative-valence signal (Extended 
Data Fig. 1b), then previously reinforced lever-pressing actions would 
gradually decrease during AGRP neuron photostimulation because 
nutrient ingestion would not be capable of reducing exogenously 
elevated AGRP neuron activity. As an alternative hypothesis, if 
AGRP neuron activity predominantly influenced food-seeking by 
enhancing the positive valence of food consumption (Extended 
Data Fig. 1a), then lever-pressing would remain elevated. 

We initially compared two groups of mice that were trained to lever- 
press on a progressive ratio 7 (PR7) food reinforcement schedule under 
food restriction (Fig. 3a). After learning the contingency between lever- 
pressing and food delivery, one group that was maintained under food 
restriction showed steady lever-press responses for 15 sessions. The 
other group was re-fed ad libitum and was tested for instrumental 
food-seeking during AGRP neuron photostimulation, in which 
AGRP neuron stimulation was continued after the levers and food 
access were withdrawn (Fig. 3a). Ad libitum fed mice initially 
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Figure 4 | Food rapidly reduces AGRP neuron activity. a, b, Configuration 
for deep-brain calcium imaging from AGRP neurons in freely moving mice. 
c,d, Image of AGRP°@™"™ neurons (c) by deep-brain calcium imaging and their 
region of interest (ROI) spatial filters (d) for image analysis. Scale bar, 15 jum. 

e, Change in baseline GCaMP6 fluorescence for neurons in mice under ad libitum 
fed and food-restricted conditions (61 neurons, 4 mice). f, From food-restricted 
mice, GCaMP6f fluorescence traces from subset of individual neurons in 

c, d, during chow pellet food consumption. Black line, food delivery. Blue bars, 
food consumption. g, Normalized Ca** responses of AGRP neurons (99 neurons, 
4 food-restricted mice) during exposure to a chow food pellet (left) and a false food 
pellet (right). Black lines, chow/false food delivery. Red lines, first contact with 


responded to AGRP neuron photostimulation with high lever-press 
rate and food consumption, similar to the food-restricted mice (t-test, 
P = 0.49; Fig. 3b). In subsequent sessions, photostimulated ad libitum 
fed AGRP“" mice showed a progressive decline in lever presses, 
pellets consumed, and break point (Fig. 3b-e and Extended Data Fig. 
4a-c). Lever-pressing was reduced nearly to the low levels observed 
without photostimulation (Fig. 3b and Extended Data 4a-c), and press- 
ing at high-effort response ratios was most strongly diminished 
(Fig. 3f, g). In a separate group, using a shorter photostimulation 
protocol, lever-pressing was reduced to an intermediate level 
(Extended Data Fig. 4d-f). We noted an increase in body weight 
during the multi-session AGRP neuron stimulation protocol, but 
suppression of lever pressing for food was not due to these long-term 
metabolic changes (Extended Data Fig. 5). Furthermore, ad libitum 
food intake during AGRP neuron photostimulation was not altered 
with the extended stimulation protocol, indicating that reduced 
instrumental food-seeking was not due to food aversion or dimin- 
ished effectiveness of repeated AGRP neuron photostimulation 
(Extended Data Fig. 6). Taken together, progressive reduction of 
AGRP neuron-evoked instrumental food-seeking responses in ad 
libitum mice is consistent with the negative-valence properties of 
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chow/false food. h, Mean calcium responses to chow food and false food aligned to 
delivery time (99 neurons, 4 food-restricted mice). Shading, s.e.m. i, Change in 
normalized GCaMP6 fluorescence comparing initial baseline activity, first food 
exposure, and after consuming to satiety (110 neurons, 4 food-restricted mice). 
j, GCaMP6f fluorescence traces from 2 example neurons (2 mice) during short 
trials of food (top) and false food (bottom) delivery. k, Mean GCaMP6 
fluorescence responses from individual mice to chow food exposure aligned with 
food delivery (left) and food contact (right). 1, Mean GCaMP6 fluorescence 
responses before (black) and after (red) cued Pavlovian trace conditioning (before, 
60 neurons; after, 65 neurons; 3 mice). Black and red bars, range for first lick of 
liquid food. Shading, s.e.m. ***P < 0.001. Values are means + s.e.m. 


AGRP neurons and indicates reduced value of nutritive food when 
AGRP neuron activity remains elevated. 


Food rapidly inhibits AGRP neurons 


AGRP neuron electrical activity manipulations condition learning, but 
an essential consideration is the correspondence of perturbation studies 
to the endogenous activity patterns of AGRP neurons during feeding 
behaviours. To investigate this, we monitored AGRP neuron activity in 
freely moving mice using deep-brain imaging of genetically encoded 
calcium indicators through an intracranial gradient index (GRIN) lens 
with a head-mounted miniature microscope” (Fig. 4a, b). 

Genetically encoded calcium indicators (GCaMP6f or GCaMP6s”’) 
were expressed in AGRP neurons (Fig. 4c, d) and were well tolerated 
(Extended Data Fig. 7a). Characterization in brain slices revealed 
sharp increases in calcium activity during burst firing, while changes 
in tonic firing were detected as a gradual change in the baseline fluor- 
escence (Extended Data Fig. 7b). Characterization in vivo by injection 
of the orexigenic hormone ghrelin substantially increased GCaMP6 
brightness and dynamic responses in 81% of AGRP neurons 
(Extended Data Fig. 7c-e and Supplementary Video 1, 4% of AGRP 
neurons decreased), which subsequently returned to baseline levels 
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(population f)/2: 19 min, individual neuron t,. range: 5-46 min, 
Extended Data Fig. 7f, g). These responses are consistent with prev- 
iously reported electrical activity changes ex vivo’. Therefore, imaging 
AGRP neuron calcium dynamics allows individual neuron activity 
patterns to be monitored in vivo. 

We used deep-brain calcium imaging to monitor AGRP neuron 
activity in food-restricted mice, which was elevated over the ad libitum 
fed condition in 54/61 AGRP neurons (Fig. 4e). Delivery of a mouse 
chow pellet to food-restricted mice resulted in rapid reduction of 
GCaMP6 fluorescence during food consumption in 106/110 neurons 
(96% of neurons, n = 4 mice, Fig. 4f-i and Supplementary Video 2); 
1/110 neurons increased fluorescence. Removal of the food, after less 
than 50 mg had been consumed, was followed by a gradual increase in 
AGRP neuron calcium activity to a level that remained slightly below the 
initial baseline value (Fig. 4j). In contrast, a false-food pellet (for 
example, wood block) only transiently reduced AGRP neuron activity, 
which rapidly recovered after the mouse contacted the object (Fig. 4g, h). 
Multiple trials with a short exposure to food led to progressive decline in 
baseline fluorescence that was significantly larger than for the false-food 
object (Fig. 4j and Extended Data Fig. 7h). These experiments show that 
in food-restricted mice, baseline AGRP neuron activity was gradually 
reduced by the consumption of nutritive food, in line with homeostatic 
regulation, but AGRP neuron activity was also rapidly and strongly 
suppressed during initiation of food consumption behaviours. 

Further analysis of the rapid response to nutritive chow food 
pellets revealed that calcium activity was reduced before food 
consumption (Fig. 4k). Moreover, presentation of a visible but 
inaccessible food pellet, reduced AGRP neuron activity nearly to 
the same level as during a subsequent food consumption trial 
(Extended Data Fig. 7i and Supplementary Video 3). These obser- 
vations demonstrate that AGRP neuron activity is inhibited by food- 
related cues before nutrients are tasted or consumed. To examine if this 
rapid AGRP neuron inhibition involves learning, we used Pavlovian trace 
conditioning to determine the capability of an initially neutral conditioned 
stimulus to modulate AGRP neuron activity. Initial exposure to a 200 ms 
auditory and visual compound conditioned stimulus showed a slight 
increase of mean AGRP neuron calcium activity (Fig. 41), but 
GCaMP6 fluorescence was reduced just prior to consumption of a palat- 
able liquid food delivered by subsequent presentation of a lick spout. 
After repeatedly pairing the conditioned stimulus with food presentation, 
the conditioned stimulus elicited reduction of AGRP neuron activity 
(before spout extension, P < 0.001, unpaired t-test), and food consump- 
tion had little additional effect (Fig. 41). Therefore, AGRP neurons pre- 
dictively encode the receipt of nutritive food by rapidly reducing 
activity, and this process involves learning. Together with neuron silen- 
cing and activation experiments (Figs 1-3), these studies demonstrate 
that endogenous AGRP neuron dynamics during food consumption 
correspond to the activity manipulations that conditioned preference 
for flavour and contextual cues. Moreover, the rapid recovery of AGRP 
neuron activity during false food exposure is expected to reduce pref- 
erence for non-food objects because AGRP neuron activity signals 
negative valence. 


A virtual thirst state is avoided 


Finally, we explored the possibility that a negative-valence signal was used 
by other homeostatic neurons that mediate a different survival need. To 
investigate this, we developed an animal model of evoked-thirst by che- 
mogenetic and optogenetic induction of water-seeking and consumption. 
Prior work has shown the importance of the subfornical organ (SFO) in 
the brain for mediating water intake*”°. Chemogenetic subfornical organ 
activation selectively elevated consumption of water, but not food, and 
increased breakpoint for water on a progressive ratio 3 schedule 
(Extended Data Fig. 8a-i). Elevated drinking was observed by optogen- 
etically activating a subfornical organ neuron subpopulation molecularly 
defined by expression of nitric oxide synthase 1 (Nos1) (Fig. 5a-c). 
SFONOS!“>®? photostimulation rapidly increased water consumption 
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Figure 5 | Virtual dehydration state is avoided. a, Optical fibre position over 
SFO (box). b, Expression of ChR2-eYFP in SFON°*! neurons. Scale bar, 1 mm. 
c, Fos immunofluorescence following photostimulation in SFONOS!-CAR2 mice. 
Scale bar, 100 um. d, Water consumption by SFONOS!-ChR2 mice either before or 
during photostimulation (1 h) at different frequencies (n = 8). e, f, Closed-loop 
place preference for SEON°S'"* mice (filled circles, n = 12) and untransfected 
controls (open circles, n = 6) as in Fig. 3e. Blue bar, photostimulated side. (group: 
Faiz) = 26.2, P < 0.001; session: F(7,112) = 0.74, P = 0.64; interaction: Fi7,112) = 
4.25, P< 0.001). g, Change in occupancy time during an extinction session for the 
photostimulated side for SEON°S'""? mice (n = 12) and untransfected controls 
(n = 6).*P < 0.05, **P < 0.01, ***P < 0.001. Values are means + s.e.m. 


(latency, 20 Hz: 3.8 + 0.5 min, n = 8) at a range of photostimulation 
frequencies (Fig. 5d) but not food intake (Extended Data Fig. 8)). 

To examine the conditioning properties of SFON°*' neurons, we used 
the same closed-loop place conditioning protocol as for AGRP neurons 
(see Fig. 2g). Over the course of 7 sessions, mice showed avoidance of the 
side paired with SFON°* neuron photostimulation (Fig. 5e, f). In a 
subsequent extinction test in the absence of photostimulation, mice 
preferred the side associated with cessation of SFON°*’ neuron photo- 
stimulation (Fig. 5g). These experiments show that places associated 
with SFON°S! neuron activation are avoided, which demonstrates a 
negative-valence signal that can condition learning from a second 
homeostatic neuronal cell type with a distinct behavioural function. 


Discussion 


Physiological need states, in part acting through circulating hor- 
mones, lead to elevated electrical activity of specialized need-sensitive 
neurons, such as AGRP and SFON©*! neurons. Here we show that 
these molecularly defined neuron populations signal negative valence. 
Negative valence physiological states impose a cost on inaction or 
actions that fail to reduce the need state. Furthermore, through the 
reduction of negative-valence signals, preference for cues associated 
with alleviating physiological need states can be learned. 
Correspondingly, deep-brain calcium imaging demonstrated that 
AGRP neuron activity is rapidly inhibited during both food consumption 
and by cues that predict food. Recent measurement of mean population 
activity in AGRP neurons also found fast inhibitory dynamics”, and we 
show that nearly all AGRP neurons have this property. We also find that 
rapid reduction of AGRP neuron activity involves the learned association 
of sensory information with food consumption, highlighting the exist- 
ence of neural circuit inputs carrying information about conditioned 
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stimuli. Moreover, sustained reduction of AGRP neuron activity requires 
nutrient ingestion, consistent with homeostatic regulation, probably 
involving well-established hormonal control mechanisms. 

The valence of increased AGRP neuron activity is opposite to 
analogous neuronal perturbations in the lateral hypothalamus, 
which also lead to avid food consumption but exhibit reward- 
ing properties'”'**'**. This may reflect mechanistic differences 
between homeostatic and hedonic motivation for food, which, for 
the latter, is primarily distinguished by appetite for highly palatable 
food even in the absence of a need state**. Our experiments, taken 
together with other studies, indicate that homeostatic need states 
regulate behaviour through a combination of negative- and pos- 
itive-valence signals contributing to Pavlovian and instrumental con- 
ditioning. Modulation of both processes can be coordinated by 
hormones such as ghrelin, leptin, and angiotensin***°, as well as 
synaptic inputs’’**. Under homeostatic deficit, negative and positive 
reinforcement processes are expected to operate in a concerted 
push-pull manner, respectively, to achieve outcomes that have the 
highest value in that physiological state. 

The negative valence of elevated AGRP and SFON*' neuron activ- 
ity is also consistent with human self-reports of negative feelings 
associated with hunger and thirst arising from homeostatic defi- 
cits”’*’. The behavioural characteristics of AGRP neuron activity in 
mice parallel some negative emotional aspects of weight-loss in 
humans, which contribute to low long-term behavioural compliance 
on weight-loss diets'*!. The failure to maintain weight-loss reverses 
the multifaceted clinical benefits of reduced body weight, such as 
lessening of diabetes and hypertension symptoms. Our experiments 
show that AGRP neuron circuits, which are conserved in humans, 
provide an entry point to investigate the relationship between meta- 
bolism and negative emotional states. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 14 July 2014; accepted 19 March 2015. 
Published online 27 April 2015. 


1. van den Top, M., Lee, K. Whyment, A. D., Blanks, A. M. & Spanswick, D. Orexigen- 
sensitive NPY/AgRP pacemaker neurons in the hypothalamic arcuate nucleus. 
Nature Neurosci. 7, 493-494 (2004). 

2. Krashes, M. J. et a/. Rapid, reversible activation of AgRP neurons drives feeding 
behavior in mice. J. Clin. Invest. 121, 1424-1428 (2011). 

3. Luquet, S., Perez, F.A., Hnasko, T. S. & Palmiter, R. D. NPY/AgRP neurons are 
essential for feeding in adult mice but can be ablated in neonates. Science 310, 
683-685 (2005). 

4. Aponte, Y., Atasoy, D. & Sternson, S.M. AGRP neurons are sufficient to orchestrate 
feeding behavior rapidly and without training. Nature Neurosci. 14, 351-355 
(2011). 

5. Atasoy, D., Betley, J. N., Su, H. H. & Sternson, S. M. Deconstruction of a neural circuit 
for hunger. Nature 488, 172-177 (2012). 

6. Sternson, S.M. Hypothalamic survival circuits: blueprints for purposive behaviors. 
Neuron 77, 810-824 (2013). 

7. Yiin, Y.M., Ackroff, K. & Sclafani, A. Flavor preferences conditioned by intragastric 
nutrient infusions in food restricted and free-feeding rats. Physiol. Behav. 84, 
217-231 (2005). 

8. Dickinson, A. & Balleine, B. in Stevens’ Handbook of Experimental Psychology Vol. 3 

(ed. R. Gallistel) Ch. 12 497-533 JJohn Wiley & Sons, 2002). 

9. Rescorla, R. A. & Solomon, R. L. Two-process learning theory: relationships 

between Pavlovian conditioning and instrumental learning. Psychol. Rev. 74, 

151-182 (1967). 

10. Mackintosh, N. J. Conditioning and Associative Learning (Oxford Univ. Press, 1983). 

11. Berridge, K. C. Motivation concepts in behavioral neuroscience. Physiol. Behav. 81, 

179-209 (2004). 

12. Cabanac, M. Physiological role of pleasure. Science 173, 1103-1107 (1971). 

13. Robinson, M. J. & Berridge, K. C. Instant transformation of learned repulsion into 
motivational “wanting”. Curr. Biol. 23, 282-289 (2013). 

14. Hull, C. L. Principles of Behavior (D. Appleton-Century Co., 1943). 


ARTICLE 


15. Navratilova, E. etal. Pain relief produces negative reinforcement through activation 
of mesolimbic reward-valuation circuitry. Proc. Natl Acad. Sci. USA 109, 
20709-20713 (2012). 

16. Sidman, M. Avoidance conditioning with brief shock and no exteroceptive warning 
signal. Science 118, 157-158 (1953). 

17. Margules, D.L. & Olds, J. Identical “feeding” and “rewarding” systems in the lateral 
hypothalamus of rats. Science 135, 374-375 (1962). 

18. Jennings, J. H., Rizzi, G., Stamatakis, A. M., Ung, R. L. & Stuber, G. D. The inhibitory 
circuit architecture of the lateral hypothalamus orchestrates feeding. Science 341, 
1517-1521 (2013). 

19. Stunkard, A. J. & Rush, J. Dieting and depression reexamined. A critical review of 
reports of untoward responses during weight reduction for obesity. Ann. Intern. 
Med. 81, 526-533 (1974). 

20. Keys, A., Brozek, J., Henshel, A., Mickelsen, O. & Taylor, H. L. The Biology of Human 
Starvation Vol. II (Univ. of Minnesota Press, 1950). 

21. Wadden, T.A., Stunkard, A. J. & Smoller, J. W. Dieting and depression: a 
methodological study. J. Consult. Clin. Psychol. 54, 869-871 (1986). 

22. Takahashi, K. A. & Cone, R. D. Fasting induces a large, leptin-dependent increase in 
the intrinsic action potential frequency of orexigenic arcuate nucleus 
neuropeptide Y/Agouti-related protein neurons. Endocrinology 146, 1043-1047 
(2005). 

23. Betley, J.N., Cao, Z. F., Ritola, K. D. & Sternson, S. M. Parallel, redundant circuit 
organization for homeostatic control of feeding behavior. Ce// 155, 1337-1350 
(2013). 

24. Magnus, C. J. et al. Chemical and genetic engineering of selective ion channel- 

ligand interactions. Science 333, 1292-1296 (2011). 

25. Cravens, R. W. & Renner, K. E. Conditioned hunger. J. Exp. Psychol. 81, 312-316 

(1969). 

Ghosh, K. K. et a/. Miniaturized integration of a fluorescence microscope. Nature 

Methods 8, 871-878 (2011). 

Chen, T. W. et al. Ultrasensitive fluorescent proteins for imaging neuronal activity. 

Nature 499, 295-300 (2013). 

28. Johnson, R. F., Beltz, T. G., Thunhorst, R. L. & Johnson, A. K. Investigations on the 

physiological controls of water and saline intake in C57BL/6 mice. Am. J. Physiol. 

Regul. Integr. Comp. Physiol. 285, R394—R403 (2003). 

29. Oka, Y., Ye, M. & Zuker, C. S. Thirst driving and suppressing signals encoded by 

distinct neural populations in the brain. Nature 520, 394-352 (2015). 

30. Chen, Y., Lin, Y. C., Kuo, T. W. & Knight, Z. A. Sensory detection of food rapidly 

modulates arcuate feeding circuits. Cel! 160, 829-841 (2015). 

31. Olds, J. in Brain Stimulation and Motivation: Research and Commentary (ed. E.S. 
Valenstein) 81-99 (Scott, Foresman and Company, 1973). 

32. Jennings, J. H. etal. Visualizing hypothalamic network dynamics for appetitive and 
consummatory behaviors. Cel! 160, 516-527 (2015). 

33. Saper,C.B.,Chou, T.C. & Elmquist, J.K. The need to feed: homeostatic and hedonic 
control of eating. Neuron 36, 199-211 (2002). 

34. Domingos, A. |. et a/. Leptin regulates the reward value of nutrient. Nature Neurosci. 
14, 1562-1568 (2011). 

35. Egecioglu, E. etal. Ghrelin increases intake of rewarding food in rodents. Addict. Biol. 
15, 304-311 (2010). 

36. Wang, Q. et al. Arcuate AgRP neurons mediate orexigenic and glucoregulatory 
actions of ghrelin. Mol. Metabol. 3, 64-72 (2014). 

37. Yang, Y., Atasoy, D., Su, H. H. & Sternson, S. M. Hunger states switch a flip-flop 
memory circuit via a synaptic AMPK-dependent positive feedback loop. Cel/ 146, 
992-1003 (2011). 

38. Krashes, M. J. et a/. An excitatory paraventricular nucleus to AgRP neuron circuit 
that drives hunger. Nature 507, 238-242 (2014). 

39. Rolls, B. J. et a/. Thirst following water deprivation in humans. Am. J. Physiol. 239, 
R476-R482 (1980). 


26. 


27. 


Supplementary Information is available in the online version of the paper. 


Acknowledgements This research was funded by the Howard Hughes Medical 
Institute. Z.F.H.C. was funded by the HHMI Janelia Farm Graduate Scholar program. We 
thank B. Balleine, M. Schnitzer, N. Ji, A. Lee, Z. Guo for suggestions on experimental 
design; H. Su for molecular biology; J. Rouchard, S. Lindo, K. Morris, M. McManus for 
mouse breeding and procedures; M. Copeland for histology; J. Osborne and C. Werner 
for apparatus design; K. Branson for automated mouse tracking software (Ctrax); 

J. Dudman, S. Eddy, H. Grill, N. Geary, U. Heberlein for comments on the manuscript. 


Author Contributions J.N.B. and S.M.S initiated the project. J.N.B., Z.F.H.C., S.X. and 
S.M.S. prepared the manuscript with comments from all authors. J.N.B., S.X., Z.F.H.C., 
R.G., C.J.M. and S.M.S. designed the experiments and analysed the data. J.N.B. and 
Z.F.H.C. performed conditioning experiments, S.X. performed in vivo calcium imaging 
experiments, R.G. and C.J.M. developed the SFO activation model for evoked water 
drinking, Y.Y. helped with image registration. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to S.M.S. (sternsons@janelia.hhmi.org). 


14 MAY 2015 | VOL 521 | NATURE | 185 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


METHODS 


All experimental protocols were conducted according to US National Institutes of 
Health guidelines for animal research and approved by the Institutional Animal 
Care and Use Committee at Janelia Farm Research Campus. 

Mice. Mice were housed on a 06:00 to 18:00 light cycle with water and mouse chow 
ad libitum (PicoLab Rodent Diet 20, 5053 tablet, TestDiet) unless otherwise noted. 
Adult male mice (>8 weeks old) were used for experiments. Cre recombinase- 
expressing lines were used: Agrp-IRES-Cre (Jackson Labs Stock 012899, 
Agrp'™(r) owl), Nos1-IRES-Cre (Jackson Labs Stock 017526, B6.129- 
Nos1'"!(2)Msmi/), For channelrhodopsin-2 expression in AGRP neurons, Agrp- 
IRES-Cre mice were crossed with Ai32: ROSA26-loxStoplox-ChR2-eYFP (Jackson 
Labs stock 012569, B6;129S-Gt(ROSA)26Sor'"?7(CAG COPPHIAREYEPE¢/9) Fog 
eGFP in AGRP neurons, Agrp-IRES-Cre mice were crossed with ROSA-GNZ: 
ROSA26-loxStoplox-GFP-NLS-LacZ (Jackson Labs stock 008516). For Arch in 
AGRP neurons, Agrp-IRES-Cre mice were crossed with Ai35d: ROSA-CAG- 
loxStoplox-Arch-GFP-WPRE (Jackson Labs stock 012735).C57BL/6J] mice were 
from Jackson Labs (stock 000664). 

Recombinant adeno-associated viral (rAAV) vectors. The following Cre- 
dependent viral vectors*”*' were used in this study: rAAV2/10-CAG-FLEX-rev- 
ChR2tdtomato (3e13 Genomic Copies (GC) per ml, University of Pennsylvania 
vector core), rAAV2/1 and rAAV2/9-hSyn-FLEX-rev-PSAM''4!"-GlyR-IRES- 
eGFP (1.4e13 and 1.5e13 GC per ml, respectively, Janelia, http://www.addgene.org/ 
Scott_Sternson/), rAAV2/9-CAG-FLEX-eGFP (7e12 GC per ml, Penn), rAAV2/ 
1-hSyn-Cre (5.6e12 GC per ml, Janelia), rAAV2/2-Efla-DIO-hChR2(H134R)- 
EYFP (6e12 GC per ml, UNC vector core), rAAV2/2-Efla-DIO-hM3D(Gq)- 
mCherry (3e12 GC per ml, UNC), AAV2/1-Syn-FLEX-GCaMP6f and AAV2/ 
1-Syn-FLEX-GCaMP6 s (10’” to 10'* GC per ml, Janelia). CAG, promoter containing 
a cytomegalovirus enhancer; the promoter, first exon and first intron of the chicken 
beta actin gene; and the splice acceptor of rabbit beta-globin gene. Eflla, human 
elongation factor-1 alpha promoter. FLEX, Cre-dependent flip-excision switch. DIO, 
double-floxed inverted orientation. 

Viral injections and optical fibre placement. Viral injections and implantation 
of ferrule-capped optical fibres (200 tm diameter core, multimode, NA 0.48, 
ThorLabs) were performed as described previously”. Bilateral ARC viral injec- 
tions in Agrp-IRES-Cre mice were made at two depths using the following coor- 
dinates: bregma: —1.3 mm; midline: + 0.3 mm; dorsal surface: —5.95 mm and 
—5.85 mm (250-500 nl per site). Bilateral SFO viral injections were made at 
bregma: —0.35 mm; midline: + 0.6 mm; dorsal surface: —2.45 mm (100-300 
nl per site). For deep-brain calcium imaging, unilateral viral injections were made 
at bregma: — 1.46 mm; midline: 0.3 mm; dorsal surface: —6.0 mm and —5.85 mm 
(100 nl per site). 

After 2 to 4 weeks for transgene expression, a ferrule-capped optical fibre was 
placed for AGRP“? mice over the ARC (bregma — 1.4 mm; midline: +0.25 mm; 
dorsal surface 5.6 mm) and for SEON°*!-Ch®? mice over the SEO (bregma: —0.35 
mm; midline: + 0.6 mm; dorsal surface: —2.3 mm; approach angle: 12°). 
Immunohistochemistry. Immunohistochemistry was as described previously”’. 
Antibodies: goat anti-AGRP (1:5,000, Neuromics, GT15023), guinea pig anti- 
RFP (1:25,000, Covance), rabbit anti-Fos (1:5,000, Santa Cruz, SC-52, Lot- 
C1010), rabbit anti-GFP (1:5,000, Invitrogen, A-11122). Confocal images 
(Zeiss LSM 510 microscope) were acquired first from Fos immunostained tissue 
taken from a food deprived mouse, as described previously”. These image 
acquisition settings were maintained for all quantitative Fos analysis (each con- 
dition: >50 nuclei from 3 mice, selected blind to the Fos immunofluorescence 
levels). Transgene transduction efficiency in AGRP neurons was determined as 
previously described” (from >500 AGRP boutons, multiple sections). 

Food restriction. Food intake was adjusted to maintain mice at 85-90% of their 
initial ad libitum fed body weight, and food was consumed 18 h before subsequent 
behavioural assays. 

AGRP neuron inhibition in vivo. For AGRP neuron silencing, mice expressing 
PSAM""4!¥.GlyR in AGRP neurons (AGRP’SAM-GUR mice) were injected intra- 
peritoneally (i.p.) with the ligand PSEM®*S (30 mg per kg) dissolved in saline. 

Suppression of refeeding in food-restricted mice. In food-restricted 
AGRP?SAM-GIR mice two 1 h food intake measurements were performed early 
in the light period and separated by a day. Saline or PSEM*S (30 mg per kg) were 
administered 30 min before food was provided, at the time food was provided, 
and 30 min later (multiple administration of PSEM**® was used due to its rapid 
clearance”’). 

Suppression of Fos expression by AGRP neuron silencing. In ad libitum fed mice, 
Fos immunofluorescence intensity was measured in AGRP neurons from 
AGRPPSAM-GR (4 = 3) and AGRP°CFP (4 = 3) during the dark period after 
PSEM’**S treatment (30 mg per kg). Fos expression in AGRP neurons is elevated 
in the dark period in the absence of food, which was removed from the mice at the 


beginning of the dark period (18:00). PSEM**S (30 mg per kg) was injected 30 min 
before the onset of the dark period and subsequently every 45 min for 5 h 
(injection frequency is due to pharmacokinetics of PSEM**’)**. Mice were deeply 
anaesthetized and then perfused, and the brain was dissected for immunohisto- 
chemical analysis. 

Suppression of dark period feeding. Ad libitum fed mice were injected with 
either saline (test day 1) or PSEM*” (test day 2). At the onset of the dark period, 
mice were injected again and given free access to chow. PSEM**® (30 mg per kg) or 
saline was administered every hour until the assay concluded at 22:00; food 
consumption was recorded. 

Photostimulation in vivo. Photostimulation was as described previously’. Light 
pulse protocol: 10 ms pulses, 20 Hz (unless otherwise noted) for 1 s, repeated 
every 4s. 

Conditioned flavour preference. Food-restricted AGR mice and 
AGRP°F? control mice were acclimatized for four sessions (15 min) to consump- 
tion of two non-nutritive gels that were sweetened with sucralose but differed by 
flavour (orange and strawberry). Prepackaged Hunts sugar-free Juicy Gels were 
used that contained 0.05 kcal per g (for comparison, 4.1 kcal per g in chow). 
Consumption during last two sessions was used to determine initial flavour 
preference (no significant initial group flavour preference, P = 0.41, t-test). For 
flavour preference conditioning, mice were given two daily 30 min sessions 
(repeated over 4 days) with each gel individually, separated by 4 h, with the order 
of conditioning for the gels inverted each day. The initially preferred flavoured gel 
was presented paired with saline injection, and the less preferred flavoured gel was 
paired with injection of PSEM*S (30 mg per kg) (each injection after 5 min of 
consumption). After conditioning, equal quantities of the two gels were presented 
(15 min) and the amount of each flavour consumed was recorded. This was 
repeated the following day with the position of the gel inverted and preferences 
from the two test sessions were averaged. 

To examine conditioned flavour preference learning during AGRP neuron 
activation, AGRP@? mice and AGRP°@#? control mice implanted with a 
ferrule-capped optical fibre over the ARC were used. Mice were conditioned as 
above with photostimulation, with the following differences. Ad libitum fed mice 
were used and were acclimatized to consume the two non-nutritive overnight 
(3 g). For conditioning, in one session, the mouse was presented with 0.3 g of its 
preferred flavoured gel and after 5 min, intracranial light pulses were applied for 
an additional 25 min. In the other session, the mouse was presented with 0.3 g of 
the less preferred flavoured gel, and the mouse was kept in the cage for 30 min 
without any light applied to the fibre. Mice typically ate the entire 0.3 g during 
each session. After conditioning, the ad libitum fed mice were presented again 
with equal quantities of the two gels for two 15 min test sessions as described 
above. 

Conditioned taste aversion. AGRP@™ mice with implanted ferrule-capped 
optical fibres were used for all groups (LiCl, saline, photostimulation, no photo- 
stimulation; all n = 6 mice). Mice were placed on water-restriction. After accli- 
mation to sipper tubes, the four groups were allowed to consume a tastant (0.15% 
saccharin solution, 20 min), and the amount was recorded. For LiCl and saline 
groups, mice were injected with either LiCl (125 mg per kg) or saline (0.9%) 
immediately following the exposure to the tastant. After 48 h, consumption of 
tastant was tested (20 min). On the next day, consumption of water (20 min) was 
measured. For the photostimulation group, mice received AGRP neuron photo- 
stimulation for 120 min immediately following exposure to tastant. The no 
photostimulation group was tethered with a fibre for the same period of time, 
but no light pulses were delivered. On the next day, consumption of tastant 
solution (20 min) was measured, followed by further photostimulation condi- 
tioning (total: 4 conditioning and 4 test sessions). The day after the last test 
session (test 4), consumption of water (20 min) was measured. 

Conditioned place preference. A two chambered apparatus was used with visual 
(black and white sides) and textural cues (black side: plastic grid (3 mm holes) 
flooring, white side: soft textured side of Kimtech bench-top-protector #7546 
Kimberly-Clark). The floor was back-lit (luminance ~ 100 Lux), and an overhead 
video camera recorded position (Basler, 3.75 Hz frame rate, gVision software, 
http://gvision-hhmi.sourceforge.net/). The apparatus was in a sound isolation 
chamber. Ad libitum fed AGRP?S“M-SY® mice and AGRP*? controls were 
acclimatized (15 min). The following day, the mouse’s position was recorded 
(1,800 s) and tracked offline using Ctrax”’, and the initial side preference was 
determined. Mice were then food restricted (FR) to 85-90% of their initial body 
weight before conditioning sessions. Daily conditioning consisted of two 1,800 s 
sessions: (1) the initially preferred side of the chamber paired with saline injec- 
tion; (2) the initially less preferred side paired with PSEM**S (30 mg per kg) 
injection. After five conditioning days, mice were ad libitum re-fed. The following 
day, mice were given free access to the entire apparatus and their position was 
tracked. The change in occupancy is the change in time spent on the initially less 
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preferred side following training, for which positive numbers reflect increased 
preference for the side on which they were injected with PSEM®”’. An additional 
set of AGRP?S4M-SYR mice was tested on the above protocol with the following 
modification: before the daily conditioning sessions, mice were given free access 
to mouse chow for 2.5 h, during which time they consumed 2-3 g. 
Closed-loop place preference. Conditioning for AGRP“? optogenetic neuron 
photostimulation was performed with a two-step protocol alternating passive 
conditioning with a closed-loop place preference test. The closed-loop place 
preference protocol alone was less reliable for conditioning avoidance, likely 
related to the minutes-long latency* of AGRP neuron activation to induce 
food-seeking and consumption. Therefore, some prior experience with prolonged 
AGRP neuron activity on one of the sides appears to improve efficacy, although 
this exposure alone did not significantly alter place preference. Passive condition- 
ing sessions involved separate exposure to each side of the apparatus (1,800 s each) 
with the mouse tethered to the fibre, where the initially more preferred side was 
paired with intracranial light pulses to photostimulate AGRP neurons. Passive 
conditioning was followed by closed-loop place preference testing on the sub- 
sequent day, in which mice were allowed free access to both sides of the chamber 
and photostimulation was applied when the mouse entered the side of the chamber 
also paired with photostimulation during passive conditioning (photostimulation 
ceased as soon as the mouse crossed to the other side). After 7 conditioning-days 
(morning, closed-loop place preference; afternoon, passive conditioning), the 
mice were given free access to the chamber for 1,800 s without photostimulation 
(extinction test). AGRP**"? control mice were treated identically. The same pro- 
tocol was used for conditioning SFON°S“"? mice (Fig. 5). 

Cocaine place preference. Conditioning for AGRP®” or AGRP°OP optoge- 
netic neuron photostimulation was performed in combination with a cocaine 
injection. Three passive conditioning sessions involved separate exposure to each 
side of the apparatus (1,800 s each) where the initially less preferred side was 
paired with intracranial light pulses and i.p. cocaine administration (10 mg 
per kg), and the initially preferred side was paired with saline injection and no 
photostimulation. 

Post hoc assessment of conditioned appetite. After conditioning avoidance of 
the side associated with AGRP neuron activation, AGRP“"? mice were subse- 
quently tested for the possibility that AGRP neuron activation might have con- 
ditioned elevated appetite in the context associated with prior AGRP neuron 
activation. Ad libitum fed mice in the light period (09:00 to 12:00) were restricted 
to the side of the chamber previously associated with photostimulation in which 
they were given free access to food for 1 h. Ina subsequent session performed ona 
different day, the mouse was confined to the other side of the chamber and also 
given free access to food for 1 h. The side order was counterbalanced, and expo- 
sure to food on both sides was in the absence of photostimulation. 
Instrumental conditioning. Training and tests were conducted in operant con- 
ditioning chambers (Coulbourn Instruments) housed in sound isolation cham- 
bers as previously described”’. 

Negative reinforcement. Male AGR mice implanted with a ferrule-capped 
optical fibre over the ARC were used if they consumed at least 0.7 g of food (20 mg 
grain pellets, TestDiet) during photostimulation as a test for proper fibre place- 
ment. We adapted a negative reinforcement instrumental conditioning protocol 
that was previously established for avoidance of optogenetic lateral habenula 
photostimulation**. Conditioning chambers had two nose poke ports, both with 
backlighting: one active port to stop AGRP neuron photostimulation and one 
inactive. AGRP“? mice were photostimulated and each nose poke resulted in a 
20 s pause for light pulses, and a tone and houselight cue were turned on until the 
laser stimulation returned (session time: 20 min). 

A different set of male ad libitum fed AGRP“*? mice were first trained to 
perform a lever pressing task on a fixed ratio schedule (FR1) in order to obtain 
food. Training occurred overnight, and a minimum of 50 lever presses was the 
inclusion criterion. The protocol above was applied where photostimulation 
pause was contingent on lever pressing. 

Instrumental conditioning for AGRP neuron inhibition. AGRP“' mice (Agrp- 

IRES-Cre;Ai35d) were food restricted to 85-90% of their body weight. Testing 
was in sound isolation boxes with operant chambers containing one backlit nose 
poke port. Each nose poke resulted in 60 s of light (561 nm) delivery at a power of 
15 mW delivered from fibre tip, in conjunction with onset of tone and houselight 
cues (session length, 1 h). AGRP neurons were estimated to receive irradiance of 
>15 mW per mm’ (http://www.openoptogenetics.org). 
Progressive ratio 7 food reinforcement schedule. Male AGR mice 
implanted with a ferrule-capped optical fibre over the ARC were used for instru- 
mental conditioning in a lever-press task for food. Mice were used that consumed 
at least 0.7 g of food during a 1 h photostimulation test. Mice that did not reach 
this criterion were used for the control (food-restricted) group. 
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Experiments were conducted in operant conditioning chambers with two 
retractable levers, one active (delivered food pellets) and one inactive (did not 
deliver food pellets), at either side of the food hopper, and chambers were housed 
in sound isolation boxes. Levers were extended at the start of each session. After 
reaching lever-press criteria for a reward, levers were retracted and a food pellet 
was delivered. Five seconds after pellet removal from the food hopper, levers were 
extended again. 

Training. Food rewards during training consisted of grain pellets (20 mg) with 
identical composition to homecage food. Lever-press training was conducted 
under food restriction, where mice were maintained at ~85% body weight. All 
mice were trained to lever press for food with a FR1 reinforcement schedule 
overnight for one night. Mice were trained on daily, 30 min FRI sessions until 
reaching learning criteria (earning 18 pellets in a 30 min session for 3 consecutive 
days). They were then trained with 2 h sessions for two days on a progressive ratio 
schedule where the required number of presses for each subsequent reward 
increases by 3 (PR3). Mice were then trained on a PR7 schedule for one day in 
a 2h session. 

Progressive ratio 7 reinforcement. Following training, mice were tested on PR7 
test for 15 consecutive days. PR7 test sessions were 2 h, however, mice were only 
allowed to lever press for food for the first 40 min of each session. At the start of 
PR7 testing, food rewards were switched to 20 mg grain pellets with 1% saccharin 
and grape flavouring (TestDiet) to allow comparison of reinforcing effects of food 
consumption outside of the testing session (see below). All mice received expo- 
sure to and consumed these grape flavoured pellets in their homecages (50 pellets 
available) the night before the start of tests to limit neophobia. 

Food-restricted group: Mice in the food-restricted group were tested while 
tethered to a dummy fibre to ensure that this tether does not interfere with lever 
pressing activity or with food consumption. 

AGRP neuron stimulation groups: For the AGRP neuron stimulation groups, 
mice were returned to ad libitum food intake for 2 days after training, before 
initiating testing. Mice were maintained under well-fed conditions. During PR7 
test sessions, one group of mice received photostimulation for the whole length of 
the session (2 h). Within this group, some mice were provided regular chow in 
their home cages, while others were maintained on the same food used as rewards 
during the test session for the duration of the experiment. These two subgroups 
were ultimately combined for statistical analysis because no difference in lever 
pressing was observed between them. A second group received photostimulation 
only during the first 40 min of the session, when the mice were allowed to press for 
food. Mice in both photostimulation groups were also tested for lever pressing in 
the absence of photostimulation after the 15 day test sessions. 

Food restricted with AGRP neuron stimulation group After training, a group of 
food-restricted mice was tested with AGRP neuron photostimulation during a 
PR7 food reinforcement schedule with photostimulation for the whole length of 
the session (2 h). Mice were then returned to ad libitum food for 2 days, and 
retested on a PR7 schedule under well-fed conditions without photostimulation. 

No stimulation group After training, mice in the no stimulation group were 
returned to ad libitum food intake for 2 days, before initiating testing. All mice 
were maintained under well-fed conditions without photostimulation for testing 
on a PR7 schedule for 16 consecutive days. Mice were tethered to a dummy fibre 
during testing. 

Mice that did not earn at least 5 food rewards on the first day of PR7 test were 

removed from the experiment (one mouse from the food-restricted group). 
Breakpoint was defined as the last ratio completed before 5 min passed without 
earning a reward. For rate of lever pressing analysis, lever presses were divided 
into two blocks: first 10 min (low-effort work requirement) and rest of session 
(high-effort work requirement). The first 10 min were chosen as low-effort work 
requirement conditions as average breakpoint time was greater than 10 min for all 
groups. 
Photostimulation-induced weight gain. We noted an increase in body weight 
during the multi-session AGRP neuron stimulation protocol (Extended Data Fig. 
5a), probably due to a long-acting effect of released AGRP* following photosti- 
mulation (this is not responsible for acute food consumption under investigation 
here, which is due to the release of neuropeptide Y and GABA***’). To examine 
whether suppression of lever pressing for food was due to these long-term meta- 
bolic changes, we performed control experiments with photostimulation-induced 
body weight gain separately from interference with negative reinforcement. 

AGRP“"? mice were trained to lever press under food restriction as described 
above. After training, mice were returned to ad libitum food for 2 days before 
testing began. Food rewards were switched to 20 mg grain pellets with 1% sac- 
charin and grape flavouring (TestDiet) during PR7 testing, mirroring the pro- 
tocol used in the PR7 reinforcement assay above, and only the differences are 
described. Mice were then tested under PR7 food reinforcement during photo- 
stimulation of AGRP neurons. Next, mice underwent the weight gain induction 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


period. Weight gain was induced in one group of mice through daily, 2 h photo- 
stimulation sessions (22 days) until weight gain matched that of the 2 h photo- 
stimulation group in the PR7 test (~28%). A second group did not receive 
photostimulation, but were tethered to an optic fibre and served as controls for 
natural weight gain and any potential decline in lever pressing due to the time 
elapsed between tests. During these sessions, mice did not lever press for food. 
Mice in both groups were allowed to consume 20 mg grain pellets with 1% 
saccharin and grape flavouring with the number of rewards each day matched 
to the average number of rewards on the corresponding session from the 2 h 
photostimulation group in the PR7 test. After ~28% weight gain in the photo- 
stimulated group, well-fed mice were then tested again on a PR7 task during 
AGRP neuron stimulation. 

Repeated daily AGRP neuron-evoked free feeding assay. To assess the conse- 
quences of repeated daily AGRP neuron photostimulation sessions on ad libitum 
food intake, mice were tested as in the PR7 experiment, but pellets were freely 
delivered without levers present. 

GRIN lens implantation and baseplate fixation. Mice expressing GCaMP6f or 
GCaMP6 s in AGRP neurons were anaesthetized using isoflurane, and a rectangle 
craniotomy (2-3 mm) was made around viral injection coordinates (bregma, 
—1.46 mm; midline, 0.3 mm). A customized sharp optical fibre (diameter: 0.6 
mm) was inserted to the brain to ~250 jm above the ARC. After retraction of the 
fibre, a gradient index (GRIN) lens (Part ID: GLP-0584; diameter: 0.5 mm, length: 
8.2 mm; Inscopix) with a custom GRIN lens-holder was slowly (150 j1m per min) 
implanted. The target depth was determined by observing fluorescent signal 
through a miniature microscope (nVista HD and HD v2, Inscopix). The GRIN 
lens was fixed with black dental cement (Lang Dental Manufacturing); then a 
head bar was fixed with dental cement. A layer of parafilm was covered the top 
end of the lens. A silicone adhesive (Kwik-Sil; World Precision Instruments) was 
applied above the parafilm to protect the lens. 

Two to four weeks after GRIN lens implantation, awake mice were head-fixed 

by a head bar holder. A baseplate (Part ID: BPL-1 and Part ID: BPL-2; Inscopix) 
attached to the miniature microscope was positioned above the GRIN lens. The 
focal plane was adjusted until neuronal structures and GCaMP6 dynamic res- 
ponses were clearly observed. Then mice were anaesthetized by isoflurane and the 
baseplate was fixed with dental cement. 
Calcium imaging in freely moving mice. Mice were habituated to head-fixation 
and the microscope was connected to the baseplate when the animal was head- 
fixed followed by 30 min acclimatization before imaging sessions. Fluorescence 
images were acquired at 10 Hz and the LED power was set 10-35% (0.1-0.35 
mW) with analogue gain 3-4. To compare the Ca’* activity in different test 
sessions, the image acquisition parameters were set to the same values. Animal 
behaviour was recorded by a top mounted camera (Basler) (30 Hz). A synchron- 
ization signal between the miniature microscope and camera was recorded by a 
signal acquisition system (Neuralynx). 

Ghrelin (1 ug per g) and saline injections (i.p.) were performed on ad libitum 

fed mice. Food-restricted mice (80-85% initial body weight) underwent chow 
food, false food, and Pavlovian trace conditioning tests. Chow food identical to 
that in the home cage (see Mice section of Methods). False food was a similar sized 
wood block/foam plug. For feeding experiments, either object was placed into the 
test arena 1.5 min after the onset of imaging sessions. For short exposure tests, the 
object was removed 1 min after delivery. Events of delivery, contact, leaving, and 
removal were manually marked from the behaviour video. During Pavlovian 
trace conditioning test, a 200 ms auditory (12 kHz) and a visual (blue light) 
compound conditioned stimulus was randomly presented 60-90 s after the onset 
of imaging sessions. A lickometer spout was extended 800 ms after the cue with 30 
s to access the spout which delivered a palatable liquid food (Ensure). Lick events 
were recorded by the Neuralynx system. 
Calcium image analysis. All image analyses were performed in Image] and 
Matlab. Because AGRP neurons are at the base of the brain and next to the third 
ventricle, the mechanical drift between GRIN lens and brain tissue included some 
nonlinear distortions. Movement was corrected in Janelia Computer Cluster by a 
custom Matlab script using an open source toolkit ANTs* (http://picsl.upenn.edu/ 
software/ants/). The movement-corrected images were cropped to remove the 
margin values filled by the registration. This movement correction algorithm 
allowed image analysis even when mice were chewing chow food pellets, typically 
a difficult case for brain imaging (for example, Fig. 4f does not show movement 
artefacts associated with initiation or cessation of chewing). As an example that 
calcium activity reduction observed during eating is not due to movement artefacts, 
Fig. 4f shows a neuron (neuron 1) that increases activity and is surrounded by other 
neurons (3, 10, 11, 12, see Fig. 4d) that reduce activity, indicating that the response 
properties are due to specific neuron dynamics and not a generalized movement 
artefact, which would affect the entire local area of the neurons. 


To extract calcium indicator fluorescence responses associated with individual 
neurons, a cell-sorting algorithm” based on principal component analysis and inde- 
pendent component analysis was used to automatically compute ROI spatial filters 
that were applied to the aligned imaging data. Separate spatial filters for each cell 
consisted of a weight-matrix with values between zero and one that was used to 
compute the fractional contribution of each pixel to the calculation of calcium 
fluorescence. Background fluorescence was subtracted from cropped images using 
Image] background subtraction function. Ca”~ activity of individual cells within ROI 
spatial filters were extracted from the background subtracted images. AF/Fo was 
calculated as (F—Fy)/Fo, where Fp is the lowest 5% of the fluorescence signal in image 
sessions on one day. Normalized AF/Fo was used to transform the range of AF/Fo to 
[0 1] by the equation: (AF/Fy — min(AF/Fo))/(max(AF/Fy) — min(AF/Fp)). 

ROI spatial filters identified from the same field of view were often different 
under different test sessions. To compare Ca’* activity in such situations, only the 
intersection of the ROI spatial filters were used to do further analyses and com- 
parisons. Under ad libitum fed conditions, Ca** activity is quite low, and only few 
ROIs can be detected by the cell-sorting algorithm. To compare AGRP Ca** 
activity between ad libitum fed and other conditions, the ROIs detected under 
other test conditions with higher GCaMP6 fluorescence were manually mapped 
to images from the ad libitum fed condition. 

To test whether Ca** activity of individual neurons was changed after ghrelin 

injection or chow food delivery, 30% of initial mean baseline fluorescence was chosen 
as a threshold. Neurons with mean fluorescence changes after the experimental 
manipulation that were more than the threshold, either decreasing or increasing 
fluorescence, were categorised as changed. For ghrelin administration, the mean 
fluorescence (from 1 min recording) after ghrelin injection (10 min post-injection) 
was compared to 1 min mean pre-injection GCaMP6 fluorescence. For food delivery, 
the mean fluorescence (from 1 min recording) from immediately after food delivery 
was compared to 1 min mean pre-food GCaMP6 fluorescence. 
Evoked water consumption with SFO neuron activation. For cell type-specific 
evoked water consumption, Nos1 was identified as a marker for SFO neurons by 
inspection of the Allen Brain Atlas. SFO™??4 and SFONCS'? mice were 
housed with ad libitum food and water and were transferred into a behavioural 
arena (Coulbourn Instruments) with food pellets (20 mg each) delivered through 
an automatic pellet dispenser as previously described. Water was supplied 
through a ball bearing-gated metal spout that was fed by a water bottle. Licking 
was detected by beam breaks and calibration experiments showed lick volume 
was 1.35 + 0.21 ml (m = 2 mice, mean = s.d.). 

All evoked drinking tests were performed during the early light period. Water 

intake was recorded for 1 h before the onset of neuron activation to establish a 
baseline drinking rate. This was followed by a chemogenetic or optogenetic SFO 
neuron stimulation period. For SFOPM3P4 mice, baseline water consumption was 
measured for 1 h followed by clozapine-N-oxide (Enzo Life Sciences) injection 
(ip., 2.5-5 mg per kg). Unless otherwise noted, mice had free access to food and 
consumption was also measured. 
Progressive ratio 3 reinforcement schedule for water. SFO'™34 mice that 
exhibited elevated water consumption in response to CNO (2.5-5 mg per kg, 
ip.) were used for instrumental lever-press training (mice that did not show 
elevated water consumption were determined, post hoc, to lack hM3Dq- 
mCherry expression in the SFO). Training and tests were conducted in operant 
conditioning chambers (Coulbourn) with two levers, one active and one inactive, 
at either side of the water dispenser. A pinch valve faucet controlled opening of a 
syringe tube from which the water was dispensed. For each trial, after reaching 
lever-press criteria on the active lever, a water reward (4-6 ul) was delivered by 
releasing the pinch valve (NResearch) and was controlled by Graphic State 
Software (Coulbourn instruments). Lever pressing on the inactive lever was 
monitored but did not lead to water delivery. 

For training, SFO"? mice were water restricted (1 ml per day) and trained in 
30 min daily FR1 sessions until they performed at least 250 lever presses in a 
session for 3 consecutive days. They were then trained on a progressive ratio 3 
(PR3) reinforcement schedule (each water delivery reinforcer required 3 addi- 
tional lever presses than the previous reinforcer) in 1 h sessions for three days. 
Following training, mice were rehydrated by ad libitum water access for 1 week. 
Mice were then tested on a PR3 reinforcement schedule in a 1 h session following 
CNO (2.5-5 mg per kg) or saline injection. Breakpoint was defined as the last ratio 
completed before 5 min has passed without earning a water reinforcer. 
Statistics. Values are means + s.e.m. Pairwise comparisons were calculated by 
unpaired or paired two-tail Students t-test or ANOVA. When equal variance 
assumptions were violated, nonparametric ANOVA on ranks test was used 
(see Extended Data Table 1). Post hoc multiple comparisons used Holm-Sidak 
correction. Statistical analyses and linear regressions were performed using 
SigmaPlot (Systat) or Matlab. Sample sizes were chosen to cover high and low 
viral transduction levels. Viral transduction efficiency for mice in electrical 
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activity perturbation experiments (Figs 1 and 2) was determined post hoc, which 
effectively blinded experimenters to the group identity (high or low transduction 
efficiency) for each subject. Results of statistical tests are summarized in Extended 
Data Table 1. NS, P > 0.05, *P < 0.05, **P < 0.01, ***P < 0.001. No statistical 
methods were used to predetermine sample size. 
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Extended Data Figure 1 | Models for homeostatic regulation of learning 
food preferences and food-seeking behaviours. a, The relationship between 
internal or external cues and Pavlovian approach or instrumental food-seeking 
actions is strengthened by nutrient ingestion. Nutrients have intrinsically 
positive valence’ (rewarding), and energy deficit enhances the reward value of 
outcomes associated with food intake. b, Model of food preference and food- 
seeking in which learning involves reducing an energy deficit internal state that 
has negative valence. The relationship between internal or external cues and 
food preferences or food-seeking actions is strengthened by nutrient ingestion 
outcomes that reduce energy deficit and associated negative valence (red bar 
arrows are inhibitory). Conversely, the relationship between internal or 
external cues and food preference or food-seeking actions is weakened if 
outcomes do not reduce energy deficit. 
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Extended Data Figure 2 | AGRP neuron activation does not condition taste 
aversion, and feeding reduction correlates with proportion of AGRP 
neurons inhibited. a, Experimental design for conditioned taste aversion 
experiments. Mice were water restricted and habituated to drink water from a 
spout during 20 min sessions. Four groups of mice were then allowed to 
consume a tastant (0.15% saccharin solution) for 20 min (pre-test) and 
immediately following this session, they were exposed to a conditioning agent 
(LiCl, saline, 120 min AGRP neuron photostimulation, or AGRP©? mice 
attached to an optical fibre but not phostostimulated; all n = 6 mice). The next 
day, mice were tested for consumption of the saccharin solution (test 1). For 
AGRP neuron photostimulated and non-photostimulated groups, 
conditioning and testing was extended with an additional three conditioning 
and test sessions. The day following the last testing session for each group, water 
consumption was also measured (water test). b, c, Consumption of tastant 
solution for all sessions (b) and comparison for pre-test and test 1 session 
(c). d, Confocal micrographs of Cre recombinase-expressing AGRP neurons 
transduced with tAAV-Syn-FLEX-PSAM'"*""-GlyR-IRES-eGEP. Alexa555- 
conjugated-Bungarotoxin (Bgt-555) labels PSAM‘'4!?-GlyR, which co- 
localizes with eGFP. Scale bar, 100 tum. e, f, Fos immunofluorescence in the 
ARC of mice treated with PSEM*”® during the first 4 h of the dark period 


without access to food. AGRP**'? mice (e) show high levels of Fos in AGRP 
neurons, and AGRP?SAM-SHR mice (f) express low levels of Fos in neurons that 
express PSAM-GlyR (right side); non-transduced neurons (contralateral side) 
express high levels of Fos. Scale bar, 100 1m. g, Fos immunofluorescence 
intensity in AGRP neurons from AGRPP*AM-CUR or AGRP*? mice after 
PSEM*’’ treatment during the first 4 h of the dark period without access to food 
(n = 3 mice per condition, n > 50 nuclei per condition). h, Change in food 
intake for AGRP°? mice (n = 12) or AGRPPSAM-GHR mice (n = 23) treated 
with PSEM®**S during the first 4 h of the dark period relative to saline injected on 
successive days. i, Diagram of AGRP neuron axon projection fields showing 
from where transduction efficiency was calculated. i-m, After rAAV-hSyn- 
FLEX-rev-PSAM™"*"*-GlyR-IRES-eGEP transduction of Agrp-IRES-Cre mice, 
measurement of eGFP transduction efficiency in AGRP boutons in the PVH 
(i, k) and PAG (1, m). High transduction efficiency (>50% in AGRP boutons) is 
shown (i, i) in comparison to low transduction efficiency (<50% in AGRP 
boutons) (k, m). Scale bar, 20 jim. n, Food intake reduction for mice treated 
with PSEM®® is correlated with the transduction efficiency of rAAV-hSyn- 
FLEX-rev-PSAM""*!¥_GlyR-IRES-eGFP in AGRP neurons (eGFP transduced 
boutons per total AGRP boutons) (n = 35 mice). NS, P > 0.05, ***P < 0.001. 
Values are means ~ s.e.m. 
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Extended Data Figure 3 | AGRP neuron activation does not condition 
appetite or reinforce instrumental responding. a, Experimental design to test 
conditioned appetite. After closed-loop place preference and extinction testing 
(Fig. 2), AGRP“" mice showed reduced occupancy in the photostimulation- 
paired side of the chamber. Avoidance in extinction indicated conditioning to 
offset of a negative-valence signal from AGRP neurons. An alternative 
hypothesis is that induction of food-seeking on the photostimulation side in the 
absence of food led the mouse to seek food. Because photostimulation was 
stopped when the mouse passed to the other side of the chamber, this might 
increase occupancy on the non-photostimulated side. However, this is not 
consistent with the increased avoidance of the previously photostimulated side 
in extinction (Fig. 2k) unless the contextual cues previously associated with 
photostimulation conditioned increased appetite. To test whether conditioned 
avoidance might be associated with conditioned hunger, we measured food 
intake in ad libitum fed mice after closed-loop place preference and extinction 
tests in Fig. 2g—k on each side of the apparatus in the absence of 
photostimulation. b, Mice did not show conditioned food consumption on the 
previously photostimulated side (paired t-test, n = 8 mice). This indicates that 
avoidance observed in extinction was not a consequence of food-seeking 
behaviours being differentially engaged on one side of the apparatus. 

c, d, Cessation of AGRP neuron photostimulation did not condition 


instrumental responding, c, Nose pokes by ad libitum fed AGRP“” mice 

(n = 9) during photostimulation, where a nose poke resulted in a 20 s pause in 
light pulses for each behavioural session. Nose pokes reduced across sessions 
indicating the absence of instrumental conditioning. Filled circles: active port, 
empty circles: inactive port. d, For ad libitum fed AGRP“™ mice previously 
trained to hit a lever for food, lever presses during photostimulation, where a 
lever press gives a 20 s pause in light pulses for each behavioural session 
(repeated measures ANOVA F(7,49) = 1.19, P = 0.330; n = 8 mice). 

e-h, Optogenetic silencing with Arch (550-600 nm, 8-11 mW per mm’). 

e, Cell-attached recording of AGRP neuron firing rate in brain slices from Agrp- 
IRES-Cre;Ai35d (AGRPAT) mice during light illumination. f, Whole cell 
recording of AGRP“"" during optogenetic inhibition. g, Membrane potential 
change in AGRP neurons expressing Arch during light illumination (n = 6). 
h, AGRP neuron firing rate during optogenetic inhibition of Arch-expressing 
AGRP neurons (n = 4). i, Optogenetic silencing of AGRP neurons in food- 
restricted mice did not condition free operant instrumental responding. Nose 
pokes by AGRP“"" mice resulted in 60 s of 561 nm light delivered to an optical 
fibre over the ARC. Nose poking reduced over multiple sessions (ANOVA 
F(3,24) = 7.835, P < 0.001; n = 7 mice), indicating that silencing AGRP neurons 
did not directly reinforce instrumental responding. NS, P > 0.05 Values are 
means + s.e.m. 
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Extended Data Figure 4 | Lever pressing for food is sensitive to AGRP 
neuron photostimulation duration. a, Experimental design of progressive 
ratio 7 lever-press experiment from Fig. 3 for a food-restricted AGRP neuron 
photostimulated group and an ad libitum fed non-photostimulated group. The 
two additional groups of mice were trained to lever press in food-restriction on 
a PR7 reinforcement schedule. For the food-restricted with photostimulation 
group, mice were maintained on food-restricted and tested with PR7 
reinforcement tests over 15 sessions with photostimulation. Each session was 
2 h, where levers were available for the first 40 min of the session, and 
photostimulation was delivered for the length of the session (120 min, grey). 
Mice were then ad libitum re-fed and tested on a non-photostimulated PR7 
session. For the ad libitum fed non-photostimulated group, mice were ad 
libitum re-fed following lever-press training and tested with PR7 reinforcement 
tests over 16 sessions, with no photostimulation delivered (beige). b, Lever 
presses for each PR7 session for food-restricted AGRP neuron photostimulated 
mice (grey, m = 11) mice and ad libitum fed non-photostimulated mice (beige, 
n = 8). For comparison, data are shown for food-restricted and 120 min 
photostimulated groups that are reproduced from Fig. 3b. c, Lever presses on 


first (1) and last (15) sessions in PR7 test for food-restricted with 
photostimulation mice (grey) mice and sated no photostimulation mice (beige). 
Also shown are data for food-restricted and 120 min photostimulated groups 
that are reproduced from Fig. 3c. d, Experimental design of progressive ratio 7 
lever-press experiment from Fig. 3 for a 40 min photostimulation group. One 
additional group of mice was trained to lever press in food-restriction on a PR7 
reinforcement schedule. Mice were then ad libitum re-fed and tested with PR7 
reinforcement tests over 15 sessions. Each session was 2 h, where levers were 
available for the first 40 min of the session, and photostimulation was 
delivered only while levers were available (grey). A non-photostimulated 
PR7 session was also performed after the fifteenth test session. e, Lever 
presses for each PR7 session for 40 min photostimulated (grey, n = 12) 
mice. Also shown are data for food-restricted and 120 min photostimulated 
groups that are reproduced from Fig. 3b. f, Lever presses on first (1) and last 
(15) sessions in PR7 test for 40 min photostimulated mice (grey). Also 
shown are data for food-restricted and 120 min photostimulated groups 
that are reproduced from Fig. 3c. NS, P > 0.05, **P < 0.01, ***P < 0.001. 
Values are means + s.e.m. 
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Extended Data Figure 5 | AGRP neuron-associated body weight increase 
does not suppress AGRP neuron-evoked food-seeking. a, Weight gain for the 
120 min AGRP neuron photostimulated (n = 11) group in the PR7 experiment 
(from Fig. 3) after 15 sessions. Weight gain is due to eating after the test session 
when the mouse is returned to the homecage and is associated with long-lasting 
effects from release of AGRP“. Previous experiments have shown that AGRP is 
not responsible for the acute feeding behaviour investigated in this study**”. 
However, metabolic changes associated with weight gain could be an alternative 
cause of reduced instrumental food-seeking shown in Fig. 3. To test the effect of 
weight gain in mice trained to lever press for food on a PR7 reinforcement 
schedule, we induced weight gain without the negative reinforcement 
extinction protocol from Fig. 3. b, Experimental design of progressive ratio 7 
lever-press experiment with AGRP neuron photostimulation-induced weight 
gain but lacking disruption of negative reinforcement during food-seeking. 
AGRP“ mice were trained under food deprivation to lever press under a PR7 
schedule for food pellets. After training, both groups were ad libitum re-fed, and 
the mice were divided into two groups: (1) control mice with no induction of 
weight gain (blue) and (2) the induced weight gain group (red). Both groups 
were then tested on a PR7 reinforcement schedule under AGRP neuron 
photostimulation conditions (PR7 test 1). Following this session, a 
photostimulation-induced weight gain protocol was initiated for the second 
group. Mice received one 2 h experimental session per day, where they were 
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photostimulated for the whole experimental session and body weight was 
monitored daily. During these sessions, levers were not available, but free food 
was provided during these sessions (the amount of food was matched in 
quantity to the average amount of food acquired by the 120 min 
photostimulation group under the PR7 experiment from Fig. 3 for the 
corresponding session). The photostimulation-induced weight grain protocol 
was conducted for 22 consecutive days, which was required for body weight 
gain to be comparable to levels acquired by the 120 min AGRP neuron 
photostimulation group in the PR7 experiment (~28%) from Fig. 3. Control 
mice were tethered to a fibre but did not receive photostimulation, otherwise 
they received the same experimental manipulation as induced weight gain mice 
(access to the same amount of food), and their body weight was also monitored. 
After the induced weight gain group achieved a 28% weight gain, a second PR7 
test was conducted for both groups in the same manner as the first one. c, Per 
cent body weight change for control (blue, n = 6) and induced weight gain (red, 
n = 6) mice. Grey dotted line: per cent body weight change for photostimulated 
mice in PR7 experiment from Fig. 3. d, Lever presses for control (blue) and 
AGRP neuron photostimulation-induced weight gain (red) mice on first (1) 
and second (2) PR7 test, prior and after weight gain induction protocol, 
respectively. There was no significant reduction in lever pressing between PR7 
sessions within either group. NS, P > 0.05, ***P < 0.001. Values are means + 
s.e.m. 
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Extended Data Figure 6 | Free food consumption is not reduced with photostimulation (grey) conditions. On each day, mice received one 2 h session 
repeated daily AGRP neuron photostimulation sessions. a, Experimental where food was freely available for the first 40 min of the session. AGRP neuron 
design of free feeding experiment over repeated sessions. Three groups of photostimulated group received photostimulation for the entire 2 h session (cyan). 
AGRP“"” mice were tested on a 15 session free feeding protocol (no lever b, Food intake for each session of the free feeding experiment for food restricted 
pressing required) either under food restriction (black), ad libitum fed AGRP (black, n = 6), AGRP neuron 120 min photostimulated (cyan, n = 6), and no 
neuron photostimulated (cyan), or ad libitum fed without AGRP neuron photostimulation (grey, n = 6) groups. NS, P > 0.05. Values are means + s.e.m. 
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Extended Data Figure 7 | Calcium imaging of AGRP neurons in freely 
moving mice. a, Projection of confocal images of AGRP neurons from brain 
slices after mice expressed GCaMP6s for 10 months after viral injection. A total 
of >99.5% neurons show nuclear exclusion of GCaMP6s, indicating good cell 
health. Red arrow, rare example of filled nucleus. Scale bar, 15 um. 

b, Characterization of the relationship between action potential firing rate 
(cell attached recordings) and change of GCaMP6f fluorescence activity in 
brain slices by puffing AMPA for activation (top, middle) or muscimol for 
inhibition (bottom). c, Epifluorescence images of AGRPC@™? neurons 
(left) from ad libitum mice after ghrelin injection by deep-brain calcium 
imaging and their ROI spatial filters (right) for image analysis. Scale bar, 
15 um. d, For freely moving ad libitum fed mice during in vivo imaging, 
fluorescence traces of individual AGRP°@*™? neurons in c before and after 


ghrelin injection (fluorescence responses separated in time by 4 min, during 
which time ghrelin was injected). e, Changes in mean Ca’* activity before 
and 4 min after ghrelin/saline injections (90 neurons, 4 ad libitum fed mice). 
f, Time course of changes in mean Ca’ activity after ghrelin (blue) or saline 
(red) injection (90 neurons, 4 mice). Green dashed line, exponential fit. 

g, Distribution of individual time constants for decline of ghrelin-mediated 
fluorescence increase for individual neurons showing goodness of fit >0.85 
(67/90 neurons, 4 mice). h, Baseline GCaMP6f fluorescence at the start of 
each trial before 1 min exposure to food/wood in each trial. i, GCaMP6f 
fluorescence comparing initial baseline activity, exposure to an inaccessible 
but visible food outside the cage, and subsequent consumption of food (60 
neurons, 2 mice). NS, P > 0.05, ***P < 0.001. Multiple comparisons with 
Holm’s correction. Values are means + s.e.m. 
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Extended Data Figure 8 | SFO neuron-evoked water seeking and 
consumption a, Schematic of injection targeting of hM3Dq-mCherry to SFO 
neurons. b, Epifluorescence image of mCherry fluorescence in a coronal section 
containing the SFO (box) targeted stereotaxically by co-injection of rAAV- 
hSyn-Cre and rAAV-Efla-DIO-hM3Dq-mCherry. Scale bar, 1 mm. 

c, Confocal micrograph of SFO neurons co-transduced with rAAV-hSyn-Cre 
and rAAV-Efla-DIO-hM3Dq-mCherry. Scale bar, 100 um. d, Number of licks 
for a representative SFO"? mouse during evoked water consumption 
following activation of SFO neurons by CNO injection. e, Number of licks for 
SFO"? mice following saline or CNO injection (n = 5 mice). f, Cumulative 
lever pressing for a SFO"?4 mouse following injection of CNO (red) or saline 
(black). g, h, For SFO"™?P4 mice, lever presses (red/black: active lever, grey: 
inactive lever) (g) and breakpoint reinforcement ratio (h) on a PR-3 water 
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reinforcement schedule following either saline or CNO injection (n = 5 mice). 
i, Top, experimental design to test if activation of SFO neurons can elicit food 
consumption in the absence of water. SEO"? mice were presented with 
access to food but not water for one hour (pre), which was followed by CNO 
injection, and food intake was measured for an additional hour. Bottom, food 
intake by SFO"? mice that lack access to water before (pre) and after the 
application of CNO (paired t-test, n = 3). j, Top, experimental design to test if 
activation or offset of SFO neurons elevate food consumption behaviour. 
SFONOS!-ChR2 mice had access to food and water, and both were measured 
before (1 h, pre), during (20 Hz, 1 h), and after photostimulation (1 h, post). 
Bottom, food consumed by SFONOST-ChR2 nice before (pre), during (20 Hz), or 
after (post) photostimulation (paired t-test, n = 5). NS, P > 0.05. Values are 
means + s.e.m. 
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Extended Data Table 1 | Results of statistical analysis 
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Neural dynamics for landmark 
orientation and angular path integration 


Johannes D. Seelig! & Vivek Jayaraman! 


Many animals navigate using a combination of visual landmarks and path integration. In mammalian brains, head 
direction cells integrate these two streams of information by representing an animal’s heading relative to landmarks, 
yet maintaining their directional tuning in darkness based on self-motion cues. Here we use two-photon calcium 
imaging in head-fixed Drosophila melanogaster walking on a ball in a virtual reality arena to demonstrate that 
landmark-based orientation and angular path integration are combined in the population responses of neurons 
whose dendrites tile the ellipsoid body, a toroidal structure in the centre of the fly brain. The neural population 
encodes the fly’s azimuth relative to its environment, tracking visual landmarks when available and relying on 
self-motion cues in darkness. When both visual and self-motion cues are absent, a representation of the animal’s 
orientation is maintained in this network through persistent activity, a potential substrate for short-term memory. 
Several features of the population dynamics of these neurons and their circular anatomical arrangement are suggestive of 
ring attractors, network structures that have been proposed to support the function of navigational brain circuits. 


Visual landmarks can provide animals with a reliable indicator of 
their whereabouts'. In the absence of such cues, many animals track 
their position relative to a reference point by continuously monitoring 
their own motion, a process called path integration’. Estimates of posi- 
tion based purely on self-motion cues, however, can accumulate error 
over time. Successful navigation then, requires animals to flexibly com- 
bine these distinct sources of information’. In mammalian brains this 
process of integration is evident in head direction cells’, which are 
neurons sensitive to an animal’s heading relative to visual cues in its 
surroundings that maintain their representation of heading in total 
darkness using self-motion cues*. With their smaller brains and iden- 
tifiable neurons, insects offer tractable systems to examine the integ- 
rative neural computations underlying navigation’. Indeed, many 
insects (for example, desert ants and honeybees*’) are known to navi- 
gate using landmarks and path integration’. Experiments in a variety of 
insects indicate the involvement of the central complex (CX)—a brain 
region conserved across insects—in such behaviour. In the fruitfly, 
behavioural genetics experiments have suggested that the CX is required 
for several components of navigation, including memory for visual 
landmarks®, patterns’ and places'®, and directional motor control'’. 
Electrophysiological recordings in immobilized locusts’? and butter- 
flies* have revealed a map-like representation for the orientation of 
electric field vectors of polarized light, which may enable sun-compass 
navigation'*. Extracellular recordings from CX neurons in tethered 
walking cockroaches have shown encodings of turning direction’* and 
of wide-field optic flow’®, a potential cue for self-motion. However, 
previous studies of visual responses in the CX were conducted under 
conditions in which insects passively viewed visual stimuli. We sought 
to uncover integrative neural processes relevant to navigation in the 
CX by allowing a tethered fly to control and respond to visual stimuli’” 
while simultaneously recording its neural activity and behaviour. 

We used two-photon imaging with the genetically encoded calcium 
indicator GCaMP6f (ref. 18) to monitor neural responses in the CX 
while a head-fixed fly walked on an air-supported ball within a light- 
emitting-diode (LED) arena’*”° (Fig. 1a, b; see Methods). In previous 
experiments, we identified a subset of neurons with projections to the 


CX, and specifically to rings of the ellipsoid body (EB), that show 
strong tuning to localized visual features” including vertical stripes, 
a class of stimuli that also induce innate fixational responses in 
flies*’”?. To probe how such visual information might be used within 
the CX we now focused on a class of columnar neurons of the CX, each 
of which sends dendrites to a specific wedge of the EB (Fig. Ic, d). 
These neurons are abbreviated here as EBw.s neurons (see 
Methods)’*?**’. We monitored the dendritic responses of the entire 
EBw.s population in the EB (Fig. le) during walking, both under 
closed-loop virtual reality conditions in which the rotation of visual 
patterns was driven by the fly’s turning movement on the ball and in 
darkness (Extended Data Fig. 1). 


Compass-like representation of landmark orientation 
When flies were exposed to a single vertical stripe stimulus (Extended 
Data Fig. 1a), we observed a sector of activity, or bump, in the EB that 
rotated concurrently with the stripe as the fly turned on the ball (Fig. 1f, j, 
k; Supplementary Video 1, Extended Data Fig. 2a—c). The spatial extent 
of the visual arena (270°) was mapped to the full angular extent (360°) of 
the EB (Extended Data Fig. 2d, see Methods). A population vector aver- 
age (PVA) of EBw.s activity (Fig. 1g, h) sufficed to reliably decode the 
stripe’s azimuthal position relative to the fly, or, equivalently, the fly’s 
virtual orientation relative to the stripe (Fig. 1g-i, 1, see Methods), with 
a fly-specific angular offset (Fig. 1m). Offsets occasionally changed 
between trials (for example, Fly 2 and Fly 10 in Fig. 1m and Extended 
Data Fig. 2e), but seldom within a trial (Extended Data Fig. 2f). Such 
differences in the locking of the EBw.s activity bump to visual cue 
position suggest that EBw.s population activity cannot be a static 
retinotopic representation of the animal’s surroundings”. 

The single-stripe EBw.s responses (Fig. 1) could result from a tun- 
ing to visual features”, or from a more abstracted representation of 
the fly’s orientation with respect to its environment. To distinguish 
between these possibilities, we asked how EBw.s population activity 
changes in a more complex visual scene with multiple features 
(Fig. 2a; Extended Data Fig. 1b). In this environment, a visual feature 
map would produce an EBw.s activity pattern of increased width and 
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Figure 1 | Ellipsoid body activity tracks azimuth of visual cue. a, Schematic _ arena (top). F, fluorescence intensity (arbitrary units). Arrows near top of 


of setup. Inset, close-up of fly on air-supported ball (modified from ref. 20). 
b, Schematic of fly central brain and CX: ellipsoid body (EB), fan-shaped body 
(FB), protocerebral bridge (PB), paired noduli (NO), lateral accessory lobe 
(LAL) and gall (Gall). MB: mushroom body. c, Each EBw.s neuron receives 
inputs from an EB wedge and sends outputs to a corresponding PB column, and 
to the gall’*’®. The PB has 18 columns”, but EBw.s neurons only innervate the 
central 16. OddR, EvenR, OddL and EvenL, odd and even PB columns, right 
and left side of brain, respectively. d, GFP-labelled EBw.s neurons in a brain 
counterstained with nc82 (maximum intensity projection (MIP), reproduced 
with permission from Janelia FlyLight Image Database’’). e, MIP of two- 
photon imaging stack (5 frames, 5 j1m apart, see Methods) showing EB 
processes of GCaMP6f-labelled EBw.s neurons. f, Top, closed-loop walking 
with a vertical stripe. Bottom, EBw.s activity is measured in 16 regions of 
interest (ROIs). Sample frames from calcium imaging time series (Fly 15) 
showing MIP of EB activity bump (see Methods) as fly rotates visual cue around 


complexity. Instead, consistent with EBw.s activity representing the 
fly’s orientation, we observed a single bump of similar width (Fig. 2a—c, 
Supplementary Video 2, Extended Data Fig. 3a-c), the spatial extent 
of the arena was once again mapped onto the EB (Extended Data 
Fig. 3d), and the PVA estimate of the fly’s azimuth remained accurate 
(Fig. 2d-g, Extended Data Fig. 3e, f). 

All the cues used thus far provided the fly with landmarks that 
uniquely define its orientation in the environment. An activity bump 
could thus represent either the fly’s angular position within the visual 
scene or its orientation relative to a specific landmark within it. We next 
placed the fly in a visual scene with two identical vertical stripes posi- 
tioned to map exactly opposite each other on the EB ring (Extended 
Data Fig. 1c, Extended Data Fig. 4a). Consistent with EBw.s activity 
representing orientation by flexibly locking to a single landmark, the 
resulting EBw.s representation again involved a single bump (Extended 
Data Fig. 4b-f; Supplementary Video 3), with the same mapping of the 
visual scene onto the EB as before (Extended Data Fig. 4g). PVA esti- 
mates were well correlated with the orientation of the fly in the scene 
(Extended Data Fig. 4h-m), regardless of which stripe was directly in 


h indicate frame times. g, Steps to compute PVA based on EBw.s population 
activity. EB is unwrapped from Wedge 1 to Wedge 16 to display population 
time series in h. Superimposed is PVA estimate that incorporates trial-specific 
offset (m; see Methods). h, EBw.s fluorescence transients during a single trial 
(same trial as f). Colour scale at right. Superimposed brown line indicates PVA 
estimate of angular orientation of visual cue. Top, horizontal greyscale stripe 
shows PVA amplitude; intensity scale at left. i, PVA estimate of angular 
orientation plotted against actual orientation of visual cue (see Methods). 

j, Mean and standard deviation (s.d.) of number of activity bumps in EBw.s 
population activity across trials for each of 15 flies (see Methods). k, Mean and 
s.d. of full width at half maximum (FWHM) of activity bump across trials and 
flies (see Methods). 1, Mean and s.d. of correlation between PVA estimate and 
actual orientation (pattern position) (see Methods). m, Mean and s.d. of 
angular offsets between PVA position and pattern position (see Methods, 
Extended Data Fig. 2e, f). All scale bars, 20 jum. 


front of the fly (Supplementary Video 3). In a few cases, however, we 
observed that EBw.s activity transitioned from one offset to another 
relative to the visual cues (Extended Data Fig. 5a—-c, Supplementary 
Video 4), potentially reflecting the ambiguity inherent in determining 
landmark-guided orientation in an environment with multiple indis- 
tinguishable visual landmarks. Taken together, these data are consistent 
with a function for the EBw.s population as an internal compass that 
adaptively represents the fly’s orientation relative to visual landmarks. 


Visual landmarks dominate over self-motion cues 

Our closed-loop virtual reality experiments directly couple the fly’s 
turning movements to the rotation of the visual scene. To disambigu- 
ate the contributions of visual landmark position and self-motion cues 
to the EBw.s representation of the fly’s orientation, we performed two 
sets of manipulations with a single stripe pattern. First, having 
observed hints of landmark-tethered activity in bump transitions in 
the two-stripe environment (Extended Data Fig. 5), we instanta- 
neously shifted cue position during a period of closed-loop walking 
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(Fig. 3a; also see Methods). If EBw.s responses arise from self-motion 
cues rather than landmark orientation, the abrupt shift in landmark 
azimuthal position should not by itself induce matching shifts in 
EBw.s activity. Instead, we observed that EBw.s. population activity 
moved to match the cue shift, preserving the initial offset between the 
EBw.s bump and visual cue azimuth (Extended Data Fig. 6a, b, Fig. 3b; 
Supplementary Video 5). Thus, local landmarks rather than self- 
motion cues appear to determine the position of the EBw.s bump. 
Interestingly, the bump did not always move instantaneously. In res- 
ponse to the first landmark shift in Extended Data Fig. 6b (see also 
Supplementary Video 5), for example, the bump rapidly tracks the 
shifted visual landmark, but the second abrupt displacement of the 
landmark elicits a much slower response. 

As a second manipulation, we varied the closed-loop gain that 
coupled the fly’s rotational movements on the ball to the movement 
of the visual cue (Fig. 3c). If EBw.s activity is determined by the fly’s 
orientation relative to the landmark, the bump should move in lockstep 
with landmark rotation rather than with the fly’s turning movements. 
Indeed, in almost all cases, activity in the EBw.s population faithfully 
followed the visual cue (Fig. 3d-g, Extended Data Fig. 6c). Consistent 
with this, the relationship between walking rotation and EBw.s bump 
movement was strongly dependent on closed-loop gain (Fig. 3h), 
whereas the relationship between visual cue movement and rotation 
of the EBw.s bump scaled only slightly with gain (Fig. 3i). Nevertheless, 
we occasionally observed examples of EBw.s activity being more influ- 
enced by the animal’s rotation than cue movement, particularly in 
situations of low gain (Extended Data Fig. 6d-f). Overall, as has been 
observed in the mammalian head direction system’, the EBw.s compass 
predominantly relies on visual landmarks, but its computation of the 
fly’s orientation is also influenced by the animal’s angular movements. 


Angular path integration with no visual cues 


The influence of self-motion cues on EBw.s population activity 
(Extended Data Fig. 6d-f) suggests that visual information can be 
overridden in determining compass heading. We next sought to 
uncover the contribution of self-motion cues in an environment that 
provided no visual information. A key feature of mammalian head 
direction cells is their ability to retain their compass-like function in 
the absence of visual information using path integration*. We 
searched for evidence of angular path integration, that is, tracking 
of orientation by self-motion cues, by imaging EBw.s population 
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responses of flies walking in complete darkness without prior expo- 
sure to a closed-loop visual scene on the ball (Fig. 4a). As in all visually 
stimulated conditions, EBw.s activity in the dark settled into a single 
bump, and then tracked the fly’s turning movements on the ball 
(Fig. 4b-d; Supplementary Videos 6-8, Extended Data Fig. 7a-e, h). 
However, the PVA estimate of orientation based on the activity in the 
EBw.s population often degraded over time (Fig. 4d, Extended Data 
Fig. 7b, f, h, i), with EBw.s activity not tracking very small or slow 
angular movements (Extended Data Fig. 7g, j, and Extended Data 
Fig. 8). Although the fly’s potentially impaired ability to track its rota- 
tion when tethered on a ball may contribute to the measured drift, it 
likely also reflects a common limitation of path integrators in the 
absence of corrective feedback*”*. We also noted some fly-to-fly vari- 
ability in the effective (measured) gain between ball rotation and EBw.s 
bump movement in these experiments (Extended Data Fig. 7f, i). The 
observation that the EBw.s system can operate at different gains raises 
the possibility that the compass can not only adjust to various visual 
closed-loop gains (as seen in Fig. 3), but also perhaps tune the gain 
between ball rotation and EBw.s activity through experience. However, 
prior exposure to specific closed-loop gains in visual surroundings only 
had a negligible influence on the effective gain between ball rotation and 
EBw.s activity in darkness (Extended Data Fig. 9). Overall, these results 
show that the EBw.s population performs angular path integration in 
darkness by relying exclusively on self-motion cues, albeit with a grad- 
ual accumulation of error in its orientation estimate. 


Orientation represented by persistent activity 


Having established that both visual landmark information and self- 
motion cues contribute to EBw.s population activity, we next asked 
how the system responds to the absence of both sources of orientation 
information. Specifically, we examined EBw.s activity during epochs 
when the fly stopped walking while in the dark. In almost all such cases, 
the EBw.s population maintained a representation of the fly’s orienta- 
tion (Fig. 4e-i; Extended Data Fig. 10a—f; Supplementary Videos 7 and 
8). This representation persisted even when EBw.s calcium activity was 
low, as evident in the fact that renewed bouts of movement caused a 
bump to reappear in exactly the wedges expected based on the last 
orientation of the fly (Fig. 4h and Extended Data Fig. 10a, c, e, 
Supplementary Videos 7 and 8). This representation of orientation 
sometimes persisted for more than 30 s (Fig. 4i, Extended Data 
Fig. 10b, d, f). Such persistence was also a feature of EBw.s dynamics 
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Figure 3 | EBw.s activity tracks landmark orientation cues over angular 
rotation when these cues are in conflict. a, In cue-shift experiments, fly is in 
closed-loop control of stripe position until cue is abruptly shifted to new 
position (see Methods). b, Offset between PVA estimate and actual orientation 
relative to visual cue before and after cue shift. Plot compares actual offsets with 
those expected if EBw.s activity did not follow cue position (see Methods). 

N = 50 shifts (n = 6 flies), r = 0.85, p, = 0. Fit: slope = 0.78 + 0.07, Pstope = 0, 
R? = 0.72. ¢, In closed-loop gain-change experiments, ball rotation drives 
movement of visual stimulus with different closed-loop gains. d, Fluorescence 
transients during trial with low gain of 0.6 (Fly 15 from Fig. 1). e, Comparison 
of PVA estimate versus accumulated rotation of visual cue and walking rotation 
on ball (trial in d). Walking rotation exceeds visual cue angular rotation in this 
low-gain trial. f, Similar to d, but with high closed-loop gain of 1.3 (Fly 3 from 
Fig. 1)). g, Similar to e, but with high gain (trial in f). h, Effective gain between 
walking rotation and PVA estimate for different closed-loop gains (r = 0.69, 
Pr = 0, Fit: slope = 0.85 + 0.07, Pstope = 0, n = 172, R’ = 0.48, see Methods). 
i, Effective gain between visual cue rotation and PVA estimate for different 
closed-loop gains (r = 0, pr = 15.1 X 10>, Fit: slope = —0.17 + 0.07, 

Pslope = 0.02, n = 172, R* = 0.03, see Methods). 


when the fly remained standing in a visual environment (Extended 
Data Fig. 10g—r), extending beyond durations that elicit adaptation in 
early visual circuits”’. Thus, even in the absence self-motion cues, EBw.s 
population activity maintains a stable representation of the fly’s ori- 
entation in its environment with or without visual landmarks. 
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Discussion 


The ability of animals to combine continuous path integration with 
potentially intermittent landmark-based orienting enables navigation 
in a wide diversity of environmental conditions'’. We studied the activity 
dynamics of a complete population of identified CX neurons in tethered 
walking flies and found that this network uses information from both 
landmark-based and angular path integration systems to create a com- 
pass-like representation of the animal’s orientation in the environment. 

Previous studies have described static visual maps in the CX’**”*. 
Such maps may allow navigating insects to maintain a sun-compass- 
based heading direction’*"**’*°. Here we found that EBw.s neurons 
track the fly’s orientation relative to visual landmarks in a variety of 
different visual environments (Figs 1 and 2), suggesting that the CX 
dynamically adapts to estimate the fly’s orientation within its visual 
surroundings (Extended Data Figs 2d, 3d and 4g). Subsets of ring 
neurons are likely to bring information about spatially localized visual 
features” to specific rings of the ellipsoid body”’. It is not yet clear how 
this information is converted into an abstract and flexible representa- 
tion of the animal’s orientation relative to landmarks*”, but EBw.s 
responses in a symmetric environment with two indistinguishable 
cues (Extended Data Figs 4 and 5) hint at an underlying winner- 
take-all process for landmark selection*’. Combining landmark ori- 
entation with information about the animal’s movement effectively 
creates an internal reference frame for the animal in its surroundings. 
Many of the proposed functions of the CX in directed locomotion'’”’, 
visual place learning'®, and action-selection™, may rely on this 
internal reference. Although the EBw.s population tracks the fly’s 
rotational movements in darkness, we do not yet know where and 
how translational motion, an important component of a complete 
navigational system, is incorporated. Additionally, although the cal- 
cium sensor we chose for our imaging experiments has the temporal 
resolution necessary to capture EBw.s representations of the fly’s 
angular rotation (see Methods), it lacks the precision necessary for 
us to determine whether EBw.s activity represents the fly’s predicted 
future orientation or its estimate of current orientation. 

Our observation that EBw.s activity was maintained in the absence 
of self-motion suggests that internal dynamics play a significant role in 
shaping neural activity in the fly brain, much as they do in the brains of 
larger animals. Persistent activity in the CX can maintain compass 
information when the fly is standing in darkness for 30 s—two orders 
of magnitude longer than might be explained by calcium sensor decay 
kinetics'*. Persistent activity has been shown to support maintenance 
of eye position in the goldfish* and has been proposed to underlie 
working memory in mammals”. In the CX, this activity may allow the 
fly to retain a short-term orientation memory even when landmarks 
are temporarily out of sight®. Consistent with this notion, the EBw.s 
activity bump largely remained tethered to the position of one land- 
mark even in the presence of another identical landmark in front of the 
fly (Extended Data Fig. 4i). The bump also did not always shift instan- 
taneously following an abrupt displacement of visual landmarks, as if 
temporarily retaining the original orientation reference before locking 
on to its new position (Extended Data Fig. 6b). 

Several models have been proposed to explain how visual land- 
mark and self-motion cues are integrated at the level of head dir- 
ection cell activity in mammals”’. Most rely on circuits organized as 
ring attractors: neurons are schematized as being arranged in a 
circle based on their preferred directions**, with connection 
strengths that depend on their angular separation’. With initial 
sensory input and an appropriate balance of recurrent excitation 
and inhibition, such a circuit can generate and sustain a localized 
activity bump. The bump’s position on the circle corresponds to 
the animal’s heading which is then updated by directional drive 
from self-motion signalling neurons. Direct experimental evidence 
in support of these models has been difficult to obtain in mammals 
owing to the distributed nature of the underlying circuits. 
Although the functional connectivity between EBw.s neurons is 
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Figure 4 | Path integration, drift and 
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not yet known, we have observed several of the expected features of 
ring attractor models*””’° in the dynamics of this population of 
CX neurons: organization of activity into a localized bump, move- 
ment of the bump to neighbouring wedges based on self-motion, 
drift in bump location in darkness, persistent activity, and both 
abrupt jumps and gradual transitions of the activity bump when 
triggered by strong visual input. Cell-intrinsic mechanisms could 
also underlie some of these features, including, for example, per- 
sistent activity*'’. The genetic tools available in Drosophila to 
target and manipulate the activity of identified cell types should 
allow different models for visually guided orientation and angular 
path integration to be discriminated at the level of synaptic, cellular 
and network mechanism. 


Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Fly stocks. All calcium imaging experiments were performed with 8-11 days old 
female UAS-GCaMP6f;R60D05-GAL4 flies. Flies were randomly picked from 
their housing vials for all experiments. 

Nomenclature. EBw.s neurons are referred to variously as eb-pb-vbo”*, EIP”° and 
EBw.s-PBg.b-gall.b** neurons in the fly literature. They may be homologous to 
CL1a neurons in the locust” and butterfly’*. 

Fly preparation for imaging during walking. The fly was anaesthetized on ice 
and transferred to a cold plate at 4 °C. The fly’s proboscis was pressed onto its 
head and immobilized with wax. To maximize the fly’s field of view we used a 
two-piece pyramidal stainless steel shim holder’ similar to those previously used 
for tethered flying fly experiments”. The fly was glued to a pin and positioned in 
the holder using a micromanipulator and fixed in the holder with UV gel as 
previously described’””®. The fly body axis was angled at 31° + 5° (measured 
for 5 flies) with respect to the tracking system to orient the EB optimally with 
respect to the microscope’s focal plane. To stop brain movement due to the 
pulsation of muscle M16, we cut the muscle—or the nerves innervating the 
muscle—with dissection needles if necessary. The fly holder (including the micro- 
manipulator) was then transferred to the two-photon microscope and secured 
using magnetic mounts. As previously described’*”, the fly was positioned on an 
air-supported ball with a three-axis micromanipulator and the walking velocity of 
the fly was monitored using a camera system. For all experiments described in the 
main figures we used a 6-mm diameter, 40-mg ball’. For the experiments in the 
dark described in Extended Data Fig. 7h-j, we used a 10-mm diameter, 175-mg 
ball. All balls were made of polyurethane foam. 

For experiments with visual cues, we removed parts of the antennae (funiculus 
and arista) to reduce the fly’s tendency to touch the holder. 

For experiments in which flies walked in the dark we additionally coated the 

eyes of a subset of flies with black paint (Premiere Acrylic Colours, Mars Black). 
For the 6-mm-ball experiments, Flies 4-11 had coated eyes, while Flies 2-13 had 
coated eyes for the 10-mm-ball experiments. The number of trials per fly was as 
follows (Fly (number of trials)). 6-mm ball: 1(15), 2(8), 3(8), 4(8), 5(11), 6(10), 
7(8), 8(10), 9(17), 10(10), 11(11). 10-mm ball: 1(7), 2(3), 3(8), 4(12), 5(10), 6(8), 
7(6), 8(5), 9(3), 10(14), 11(8), 12(6), 13(11). All trials across all conditions 
lasted 140 s. 
Two-photon calcium imaging. Calcium imaging was performed using a custom- 
built two-photon microscope controlled with ScanImage 4.2%. We used an 
Olympus X40 objective (LUMPlanFl/IR, NA 0.8) and a GaAsP photomultiplier 
tube (H7422PA-40, Hamamatsu). A Chameleon Ultra II laser (Coherent, Santa 
Clara, CA) tuned to 920 nm was used as the excitation source with the power 
adjusted to below 20 mW at the sample. We used the same saline as in previous 
studies”? but adjusted the calcium concentration to 2.5 mM. Focal planes were 
selected to optimize coverage of the part of the EB innervated by EBw.s neurons. 
We imaged from 5-plane volumes at a rate of 8.507 Hz with an equal spacing of 
between 4 jum to 6 um between individual focal planes. 

The calcium signals we measured may reflect synaptic input, action potential 
output or some combination of both. We chose GCaMP6f"* for our experiments 
because it offers the temporal resolution necessary to capture EBw.s representa- 
tion of the fly’s angular rotation. Based on in vivo measurements of responses to 
20 Hz spiking at the Drosophila larval neuromuscular junction, GCaMPéf has a 
time-to-peak of ~ 141 ms (close to the 8.507 Hz frame rate of our imaging system) 
and a decay time of ~380 ms (ref. 18). Assuming that one complete rotation of the 
fly is represented by activity moving through the 16 wedges of the EB, each wedge 
represents 22.5° of rotation. The maximum average rotational velocity reached by 
a fly in our experiments was ~ 35° s_' (Extended Data Fig. 1), which would result 
in a bump of activity moving across a wedge no faster than ~640 ms on average. 
Thus, possible lags in the calcium signals introduced by the rise and decay times of 
GCaMP6f would not compromise the detection of these activity changes. 
Although we do not know the actual change in electrical activity associated with 
the calcium transients we see, the kinetics of GCaMP6f provide a considerable 
margin of error. 

Visual stimulation. Visual arena. Visual stimuli were presented on a cylindrical 
LED display* spanning 270° in azimuth and 120° in elevation, and tilted by 10° 
towards the fly. The display was covered with a colour filter and a diffuser as 
previously described'?”°. 

Visual stimuli for closed-loop walking experiments. We used three different visual 
stimuli: condition 1, a bright vertical stripe spanning 120° in elevation and 15° in 
azimuth; condition 2, two bright stripes of the same dimensions separated by 135° 
(resulting in a pattern that was invariant to rotations by 135°); and condition 3, 
a pattern containing several vertical and horizontal stripes (Extended Data 
Fig. la-c). The number of trials (in brackets) for each fly for each of these condi- 
tions was: trial 1, 1(7), 2(6), 3(9), 4(11), 5(15), 6(13), 7(3), 8(7), 9(7), 10(6), 11(5), 
12(9), 13(9), 14(10), 15(6); trial 2, 1(12), 2(8), 3(10), 4(3), 5(6), 6(8), 7(16); trial 3, 


1(6) (same as fly 1 in trial 2), 2(2) (same as fly 2 in trial 2), 3(5) (same as fly 4 in 
trial 2), 4(3), 5(7) (same as fly 5 in trial 2), 6(11), 7(4), 8(11), 9(12). 

In cue-shift experiments each trial consisted of two cue shifts by the same 
angular distance within each trial (either by 60° or 120°) after at least 50 s of 
closed-loop behaviour. The first cue shift was counter clockwise from the current 
position and the next, 50 s later, clockwise by the same angular amount from the 
current position. Cue shift experiments were performed with a subset of the flies 
in condition 1. A 60° cue shift was used for flies 7(2), 11(3), 13(3), 15(2) and 
120°cue shift for flies 7(2), 8(3), 9(1), 11(3), 13(3), 15(2). 

In experiments that tested the influence of prior exposure to visual stimuli in 
closed loop on the gain between walking rotation and PVA estimate in darkness 
(Extended Data Fig. 9), we exposed the flies to 65 s of closed-loop walking with 
either low gain (mean closed-loop gain = 0.47 + 0.04, close to the fly’s default 
gain on the ball without prior closed-loop walking experience) or higher gain 
(mean set gain = 0.9 + 0.16) with a single stripe, after which the stripe disap- 
peared and the trial continued for another 60 s in darkness. For these experi- 
ments, we only used flies that showed strong rotational movement with low 
drift—as assessed at the onset of the experiment—to increase the accuracy of 
the gain calculation. We only recorded a small number of trials per fly, because the 
fly usually rotated more at the onset of experiments and walked forward more 
towards the end, leading to increased drift. For a subset of flies, we also tested the 
intrinsic gain of the flies walking in darkness without prior exposure to the closed- 
loop stimulus. The number of trials for experiments in which we combined 
closed-loop walking and walking in darkness were (fly number (trials with dis- 
appearing stripe/trials in darkness before exposure to closed-loop condition)): 
1(5/0), 2(4/1), 3(3/1), 4(5/0), 5(7/1), 6(3/2), 7(4/1), 8(6/2), 9(5/1), 10(4/1), 11(4/ 
1), 12(3/2), 13(5/2), 14(5/1), 15(1/2), 16(5/2), 17(3/2), 18(6/2), 19(4/2), 20(4/2), 
21(5/2), 22(4/2), 23(4/1), 24(2/2), 25(4/2), 26(3/2). 

Closed-loop gains to convert rotation on the ball to displacement of the stripe 
around the arena were close to 1 (‘normal gain’), but were varied from 0.4 to 1.6 in 
experiments to explore the effect of gain change on EBw.s representation. Actual 
values of the gain were verified by fitting changes in ball displacement to changes 
in pattern displacement on the arena. All patterns were displaced directly from 
one edge of the 270° arena to the other behind the fly rather than having them 
progress virtually through the 90° of visual field not represented by the arena. 
This was done to prevent abrupt changes in light intensity and to keep the number 
of features in the fly’s visual field constant. 

All experiments with visual stimulation were performed in closed loop**. The 
voltage position signal of the tracking system was read with a DAQ board and 
discretized in 20-ms intervals using custom LabVIEW software which was also 
used to update the position of the visual stimulus”. 

Data analysis. We used MATLAB (MathWorks, Inc., Natick, MA) and the 
Circular Statistics Toolbox” for data analysis. All errors and error bars shown 
are standard deviation (s.d.). No statistical methods were used to predetermine 
sample size. 

Calculation of fluorescence changes. Each imaged volume (stack of five frames) 
was averaged for analysis—we refer to this average as a ‘frame’. Each frame was 
spatially filtered with a 2-pixel-wide Gaussian filter after which background 
fluorescence was subtracted. Calcium transients recorded from behaving flies 
were smoothed with a 3rd order Savitzky-Golay filter over 7 frames (822 ms) 
for comparisons with behavioural data. The baseline for calculating AF/F was 
computed by averaging over the 10% of lowest-intensity frames in each trial. For 
display only, MIP fluorescence intensity images shown in Figs le, f, 2a, 4e, and 
Extended Data Fig. 5a were filtered with a 20-pixel-wide, 10-pixel-s.d. Gaussian 
filter (the size of each image is 216 pixels by 216 pixels). 

ROI selection. ROIs corresponding to 16 wedges of the EB were selected manu- 
ally in videos of AF/F by drawing an ellipse (with a central hole, as depicted in 
Fig. 1f, g) that surrounded the EB, and then equally subdividing the ellipse into 16 
wedges each spanning 22.5°. The number of wedges was selected based on the 
well-characterized EB wedge and PB glomerular innervation patterns of EBw.s 
neurons labelled by the R60D05-Gal4 line**. Some EBw.s neurons are known to 
arborize only in half- or demi-wedges**. Thus, our ROI selection and population 
analysis strategy may underestimate the actual resolution of the EBw.s system. 
Population vector average (PVA). The PVA was computed as the weighted 
average of EB wedge angles ranging from 0 to 360°, with average AF/F values 
for each wedge used as a weight. This PVA estimate was smoothed with a box-car 
filter over 3 frames (352 ms). We used brewermap (S. Cobeldick, MathWorks file 
exchange) with colour schemes from http://colorbrewer2.org/ to generate colour 
maps for all PVA plots except for PVA amplitude, which we display in greyscale. 
For display of PVA estimates of orientation or walking rotation, raw PVA was 
offset by the median difference (circular distance) between PVA and either the 
visual cue position (for closed-loop trials in the arena) or the walking rotation 
signal (for trials in the dark). We computed the offset using epochs of walking in 
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the final 80% of a trial, a period during which PVA estimates were typically more 
stable. The offset adjustment was necessitated by the fact that there was no 
stereotyped relationship between cue positions and EBw.s signal across flies 
(Fig. 1m, Extended Data Fig. 2e, f, Fig. 2f, Extended Data Figs 3e, f, 4j, 1, m). 
The magnitude of the offset in many animals (Fig. 1m, Extended Data Figs 2e, f, 
3e, f, 4j, 1) greatly exceeded the slight variance in the angle at which the tethered 
fly’s head was fixed relative to the LED arena. The offset also occasionally changed 
between trials for the same fly. We did not monitor the fly’s walking between 
trials, leaving open the possibility that these differences in offset arose purely from 
rotational movements of the fly (in the absence of closed-loop visual feedback) 
before initiation of the next closed-loop trial. 

Analysis of number and width of activity bumps. All ROIs with calcium tran- 
sients above a set threshold were included in an activity bump. Each contiguous 
set of ROIs above threshold defined an individual bump. We used two separate 
methods to set the threshold. In Method 1 (used for all the main figures), the 
threshold was defined as 1-s.d. above the mean of calcium transients across ROIs 
for each imaging frame. In Method 2 (see Extended Data Figs 2, 3, 4 and 7), the 
threshold was defined as the mean of calcium transients across ROIs over the 
entire trial. The width of a bump in each frame was, in all cases, defined as the full 
width at half maximum (with minimum in each frame subtracted). We used the 
Kolmogorov-Smirnov two-sample test for tests of the null hypothesis that bump 
widths for two different stimulus conditions are drawn from the same distri- 
bution. P values for this test are shown in the relevant figure legends. 

Offset between pattern position and PVA estimate. The offset between the 
pattern position and the PVA estimate was calculated as the circular distance 
between the PVA and the leftmost pixel of the pattern in Extended Data Fig. 1la—c 
across the entire trial for Extended Data Figs 2e, 3e and 41, and averaged over all 
trials for each fly in Figs 1m and 2f and Extended Data Fig. 4j. The s.d. of the offset 
was calculated as the circular s.d.”’ of the offset signal in each trial, and averaged 
across all trials. The pattern position from 0° to 270° was mapped to 0° to 360°, as 
explained below. 

Mapping of 270° visual arena onto 360° EB. To compute the mapping of the 
visual pattern onto the EB we calculated the gain—slope of a linear fit—between 
the unwrapped (see below) pattern position and the unwrapped PVA estimate for 
those trials in which the pattern moved over at least half the display. Since EBw.s 
activity in response to cues on the 270° LED arena was uniformly mapped to 360° 
of the EB (Extended Data Figs 2d, 3d, 4g), visual cue positions on the 270° arena 
were mapped linearly to an arena spanning 360° in all plots and analyses to 
facilitate comparisons of cue position with PVA estimates and walking rotation. 
Walking behaviour analysis. Ball movement was recorded at a sampling rate of 4 
kHz". Ball displacement and stimulus position were down-sampled to match the 
corresponding two-photon scan rate (8.507 Hz). Velocities in Extended Data Fig. 
1 were calculated over 20 frames (2.35 s) and averaged over the entire trial. 
Walking traces were subdivided into walking and standing epochs—only epochs 
that lasted at least 20 frames were considered for such classification. We only 
labelled epochs as ‘standing’ if the fly was standing for at least 20 successive 
imaging frames (2.35 s). 

Correlation analysis. Pearson’s correlation coefficients were computed between 
two entire ‘unwrapped’ time series, which is the cumulative sum of all angular 
displacements. For display only, we ‘wrapped’ the angular data into the -1 to 1 
range. Only trials in which the fly walked for more than 30% of the time were used 
in summary plots of correlations and gains. For Fig. 1, we can reject the null 
hypothesis that true single-trial correlations are 0 with P < 0.05 for all trials 
except for 1/12 trials of Fly 3, 1/15 trials of Fly 5 and 1/9 trials of Fly 13. For 
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these trials, the correlations are 0.028, 0.053 and 0.024 respectively. For Fig. 2g 
(multiple features) and Extended Data Fig. 4k (two stripes), P values for correla- 
tions for all trials of all flies are <0.05. For Extended Data Fig. 7b (walking in 
darkness), P > 0.05 for the correlations for only the following trials: 1/8 trials of 
Fly 3, 1/8 trials of Fly 4, 1/9 trials of Fly 6, 1/10 for Fly 8, 1/7 for Fly 9, 1/10 for Fly 
10. For these trials, correlations are 0.056, 0.037, —0.023, —0.054, 0.039, and 
—0.026 respectively. 

Computation of gains. Closed-loop gains that translated ball rotation into move- 
ments of visual patterns on the LED arena were set to fixed values for each trial. 
However, differences in infrared lighting conditions affected the optical mouse 
sensor chip system’s computation of ball rotation slightly’, resulting in small 
variations in effective closed-loop gains. To compute true gain values, the ball’s 
rotation about the vertical axis was linearly fit against the pattern position. The 
slope of this line was considered the actual gain for the trial. 

The relationship between rotation of activity around the EB and either walking 
rotation or visual cue rotation on the arena was captured by a linear fit. The gain 
between EBw.s activity rotation and either behaviour or stimulus was computed 
as the slope of this line. 

In all cases above, gains were computed across 200 frames (23.51 s) or over the 
entire walking epoch for data in Fig. 3h, i. Two-dimensional distributions of 
correlation values for flies walking in darkness were computed using a window 
of 200 frames sliding along the time series in steps of 25 frames (2.94 s). For 
Fig. 3h, i we only included walking epochs for which the visual cue moved over at 
least half of the display, and calculated gains over the entire walking bout. 
Comparison of angular velocity with PVA-estimated velocity. For Extended 
Data Fig. 8, we computed angular velocity and PVA-estimated angular velocity 
using a 20-frame window (2.35 s) and plotted the values against each other for all 
trials in darkness for each fly. Points were then coloured based on the mean PVA 
amplitude during the 20-frame epoch. 

Analysis of persistent activity. To compare changes in PVA estimates during 
periods when the fly was not walking, we selected epochs of alternating walking- 
standing-walking bouts, with walking bouts each lasting at least 5 frames (588 ms) 
and non-walking bouts lasting at least 20 frames (2.35 s), well beyond the per- 
sistence of calcium signals attributable to the decay kinetics of the indicator. All 
values were averaged over 5 frames (588 ms) before or after the stop and restart of 
walking, respectively. 

Analysis of responses to cue shifts. Changes in the offset between the visual cue 
position (the leftmost pixels of the cue seen from the fly’s perspective) and the 
PVA estimate were computed as the mean circular distance over 100 frames 
(11.76 s). We compared the 100-frame mean offset before the first cue jump to 
the offset before the second cue jump, and the offset before the second cue jump to 
the offset at the end of the trial. For comparison, we also computed the expected 
change in PVA-cue offset if the PVA were not to follow visual cue position, in 
which case the PVA-cue offset would change by the magnitude of the cue jump. 
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Extended Data Figure 1 | Visual stimuli, walking velocities and fraction of 
time walking across flies and conditions. a, Single-stripe pattern. b, Pattern 
with multiple features. c, Pattern with two identical stripes positioned 
symmetrically on the 270° visual display. In all closed-loop experiments, visual 
stimuli wrapped around the 270° arena, going directly from 0° to 270° and vice 
versa. d-g, Walking performance during closed-loop walking with a single 
stripe: d, forward velocity; e, magnitude of sideslip velocity; f, magnitude of 
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rotational velocity; g, fraction of time walking across all trials. h-k, Same as 
d-g for the pattern with multiple features. l-o, Same as d-g for pattern with two 
stripes. p-s, Same as d-g for walking in the dark on a 6-mm diameter ball. 
t-w, Same as d-g for walking in the dark on a 10-mm diameter ball. x-aa, same 
as d-g for experiments with trials that combined epochs of closed-loop walking 
with epochs of walking in darkness (Extended Data Fig. 9). 
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Extended Data Figure 2 | Closed-loop walking in visual environment with 
single stripe pattern. a, Mean and s.d. of the number of activity bumps as 
measured by Method 2 (see Methods) during all trials of all flies shown in Fig. 1. 
b, Mean and s.d. of the number of successive calcium imaging frames (recorded 
at 8.507 Hz) with more than one bump, measured using Method 1 (see 
Methods), for all flies shown in Fig. 1. c, Same as b, but computed using Method 
2. d, Histogram of slopes of the linear fit between PVA estimate and pattern 
position during walking epochs, that is, the gain between unwrapped PVA 
estimate and unwrapped pattern position. The pattern was mapped from 0°-to- 


270° to 0°-to-360° for PVA calculations (see Methods). Thus, a slope of 1 
corresponds to a visual pattern on the 270° arena that maps to the entire ring of 
the ellipsoid body. Only those walking epochs during which the pattern moved 
over at least half of the visual display were included so as to obtain an accurate 
estimate of the slope (mean slope = 0.92 + 0.32, n = 172 walking epochs, see 
Methods). e, Mean and s.d. of angular offsets between PVA position and 
pattern position for each trial (140 s, see Methods) for all flies. f, Mean and s.d. 
of s.d. of angular offset between PVA position and pattern position. 
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Extended Data Figure 3 | Closed-loop walking in visual environment with 
multiple features. a, Mean and s.d. of the number of activity bumps as 

measured by Method 2 (see Methods) during all trials of all flies shown in Fig. 2. 
b, Mean and s.d. of the number of successive calcium imaging frames with more 
than one bump, measured using Method 1 (see Methods), for all flies shown in 
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Fig. 2. c, Same as b, but computed using Method 2. d, Same as Extended Data 
Fig. 2d for the pattern with multiple features (mean slope = 0.97 + 0.43, n = 74 
walking epochs). e, Mean and s.d. of angular offsets between PVA position and 
pattern position for each trial (140 s) for all flies. f, Mean and s.d. of s.d. of 
angular offset between PVA position and pattern position. 
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Extended Data Figure 4 | Single activity bump during closed-loop walking 
in visual environment with two stripes. a, Closed-loop experiment in visual 
environment with two identical and symmetrically placed stripes. b, Mean and 
s.d. of number of bumps in EBw.s population activity across trials for each of 7 
flies. c, Mean and s.d. of FWHM of bump. Distribution of bump widths is 
significantly different from that for single-stripe stimulus (Fig. 1k); P = 4.5 
10 ° (see Methods), mean width = 78.7° + 15.6° for two-stripe trials versus 
82.3° + 11.5° for single-stripe trials. d, Mean and s.d. of the number of activity 
bumps as measured by Method 2 (see Methods) during all trials for all flies. 
e, Mean and s.d. of the number of successive calcium imaging frames with more 


than one bump, measured using Method 1 (see Methods). f, Same as e, but 
computed using Method 2. g, Same as Extended Data Fig. 2d for the pattern 
with two stripes (mean slope = 1.08 + 0.41, n = 96 walking epochs). h, EBw.s 
fluorescence transients during trial with two-stripe pattern (Fly 2 in b). i, PVA 
estimate of the fly’s angular orientation compared to actual orientation. j, Mean 
and s.d. of angular offsets between PVA position and pattern position in all flies. 
k, Correlation between PVA estimate and actual orientation of original left 
stripe for all flies. 1, Mean and s.d. of angular offsets between PVA position and 
pattern position for each trial for all flies. m, Mean and s.d. of s.d. of angular 
offset between PVA position and pattern position. 
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Extended Data Figure 5 | Example of EBw.s activity bump transitioning indicated by red time stamps. Scale bar, 20 tm. b, EBw.s fluorescence transients 
between locking to one of two identical visual cues placed symmetrically on during trial displayed in a. c, Decoding of fly’s angular orientation using 
LED arena. a, Sample frames from a calcium imaging time series showing unwrapped PVA of EBw.s activity plotted against the fly’s unwrapped 


single bump of EBw.s activity as the two-stripe pattern moved around the arena _ orientation with respect to stripe 1 and stripe 2 in the visual scene with two 
ina trial in which correlation between EBw.s activity and PVA estimate changes __ stripes. Red box corresponds to period when the EB activity bump switches 
over a 4-s period (Fly 6 in Extended Data Fig. 4b). Frames during jump from locking to one stripe to locking to the other (identical) stripe. 
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Extended Data Figure 6 | Competing influences of visual cue and self- 
motion on EBw.s activity. a, Fluorescence transients during cue shift trial (Fly 
9 from Fig. 1j). Red box highlights epochs during which cue abruptly shifted to 
new position. b, Comparison of PVA estimate versus actual orientation. 

c, Correlations between PVA estimates and actual orientation relative to visual 
cue across trials and flies for different closed-loop gain values. d, Fluorescence 
transients in the EB during closed-loop trial with a low gain of 0.58 (Fly 6 in 
Fig. 1j-m). Superimposed brown line indicates PVA estimate of orientation. 
e, Decoding of fly’s angular orientation using PVA of EBw.s activity plotted 
along with the pattern position and the fly’s walking rotation. PVA closely 
matches walking rotation rather than visual cue rotation. Note that walking 
rotation exceeds visual cue angular rotation in this low gain trial. f, Comparison 
of PVA estimate versus accumulated rotation of visual cue and accumulated 
walking rotation on the ball shows PVA estimate more closely matches walking 


rotation than visual cue rotation. 
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Extended Data Figure 7 | EBw.s activity when flies walk in darkness on balls | Methods, 6-mm ball). e, Same as d, but computed using Method 2 (6-mm ball). 
of two different diameters. a, Mean and s.d. of FWHM of bump when walking __f, Gain between accumulated PVA estimates of orientation and accumulated 
in darkness on 6-mm ball. Distribution of bump widths is significantly different walking rotation across flies for 6-mm ball. g, Sliding window correlations (200 
from that for single-stripe stimulus (Fig. 1k); P = 8 x 10° (see Methods), frames with a step size of 25 frames) between accumulated PVA estimate and 
mean width = 90.9° + 11.2° for walking in darkness versus 82.3° + 11.5° for accumulated walking rotation for different levels of s.d. of walking rotation for 
single stripe. b, Correlations between accumulated PVA and walking rotation 6-mm ball (s.d. cutoff shown included 97% of epochs). Brown line connects 
in the dark across flies for walking on 6-mm diameter ball. c, Mean ands.d. of _ highest-frequency bins. h, Correlations between accumulated PVA and 

the number of activity bumps as measured by Method 2 (see Methods) during __ walking rotation across flies when walking in the dark on 10-mm diameter ball. 
all trials (6-mm ball). d, Mean and s.d. of the number of successive calcium i, Same as f for 10-mm ball. j, Same as g for 10-mm ball. 

imaging frames with more than one bump, measured using Method 1 (see 
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Extended Data Figure 8 | Low rotational velocities during walking in 
darkness are not well captured by EBw.s activity. Comparison of angular 
velocity against PV A-estimated angular velocity for all flies walking in darkness 
on 6-mm ball (Fig. 4, see Methods). Each point is computed across a 20-frame 
window, and coloured based on the strength of the PVA during that epoch. 
Three features are prominent: (1) rotational velocity and PVA-estimated 
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angular velocity are correlated, but with some spread and with different slopes 
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for different flies, that is, effective walking-rotation-to-PVA gains can be 
different for different flies (see Extended Data Fig. 7f, i). (2) Low rotational 
velocities are not always well captured by EB activity which can drift under such 
conditions (see points near 0 of the y axis). (3) Most cases of EB activity drift 
seem to occur in phases when the PVA strength is low (as marked by dark blue 
points arranged in a horizontal line for low velocities). 
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Extended Data Figure 9 | Gain and correlation coefficients for flies walking 
with a bright stripe and after the stripe has disappeared. a, Distribution of 
gains between accumulated walking rotation and accumulated PVA estimate 
for flies walking in the dark before exposure to visual stimulus in closed-loop 
experiment (mean = 0.47 + 1.2, n = 397 walking bouts). b, Distribution of 
gains between accumulated walking rotation and PVA estimate of flies walking 
with a bright stripe with high (light red, mean = 0.86 + 0.64, n = 147 walking 
bouts) or low (light blue, mean = 0.54 + 0.5, n = 132) closed-loop gain. All 
gains used were close to the likely ‘natural’ gain. c, Distribution of gains between 
accumulated walking rotation and PVA estimate of flies walking in darkness 
after walking with a stripe under closed-loop control in high (red, mean = 0.57 
+ 0.84, n = 150) or low (blue, mean = 0.46 + 0.7, n = 134) gain conditions. 
d, Distribution of correlation coefficients between accumulated walking 
rotation and accumulated PVA estimate for flies walking in darkness before 
visual experience in the closed-loop setup (mean = 0.6 + 0.42). e, Distribution 
of correlation coefficients between accumulated walking rotation and 
accumulated PVA estimate for flies walking with a stripe under closed-loop 
control with high (light red, mean = 0.79 + 0.34) or low (light blue, mean = 
0.85 + 0.18) closed-loop gain. f, Distribution of correlation coefficients 
between accumulated walking rotation and accumulated PVA estimate for flies 
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walking in darkness after walking with a stripe under closed-loop control with 
high (red, mean = 0.48 + 0.43) or low (blue, mean = 0.49 + 0.49) gain. P values 
(Kolmogorov-Smirnov two-sample test) for tests of the null hypothesis that the 
correlations from two different conditions are drawn from the same 
distribution are as follows. The null hypothesis can be rejected at P < 0.05 for: 
SaiN Dark fterHighGain vs SaiN Dark fterLowGain: P= 0.04; SaiNparkNaive vs 
gaiNparkAfterHighGain: P = 0.01; gaiNstripettighGain VS ZaiNstripetowGain: P = 4 X 
10 *; gaingrripertighGain VS SaiNparkAfterHighGain! P = 3 X 10” 7; gaingripetowGain VS 
SaiN parka fterLowGain: P = 0.05; $aiNstripeLowGain vs SaiNparkNaive! P = 0.001; 
SaiNstripettighGain VS ZAiNparkNaive: P = 1 X 10 '°. It cannot be rejected for: 
¥aiNparkNaive VS ZAiNparkAfterLowGain: P = 0.2. Subscripts indicate conditions of 
the relevant experiments. DarkNaive: in darkness without previous exposure to 
closed-loop visual stimulus; DarkAfterLowGain: walking in darkness after a 
period of walking in closed loop with a single-stripe stimulus under low closed- 
loop gain conditions; DarkA fterHighGain: walking in darkness after a period of 
walking in closed loop with a single-stripe stimulus under high closed-loop gain 
conditions; StripeHighGain: walking with a single stripe under high closed- 
loop gain; StripeLowGain: walking with a single stripe under low closed-loop 
gain. 
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Extended Data Figure 10 | Maintenance of EB representation of orientation 
with persistent activity when the fly is standing. a, PVA estimate before stop 
compared to PVA estimate before restart for the 6-mm ball (r = 0.5, P = 1 X 
10 *°, n = 449, linear fit slope = 0.96 + 0.02, P = 0, intercept: 0.2 + 0.06, P= 
0.0006, R* = 0.83). b, Difference in PVA before stop and before restart plotted 
against duration over which the fly was standing (mean standing time, tmean = 
6.6 + 5.1 s, mean PVA difference, APVA jean = 0.09 + 1). c, Same as a for the 
10-mm ball (r = 0.56, P= 1 X 10 *1, n = 374, intercept = 0.1 + 0.06, P = 0.09, 
slope = 0.97 + 0.016, P = 0, n = 374, R= 0.903). d, Same as b for the 10-mm 
ball (tmean = 6.2 + 4.58, APVApnean = 0.03 = 0.8). e, PVA estimate before stop 
compared to PVA estimate at restart for the 10-mm ball (r = 0.48, P = 1 X 


single stripe closed-loop trial plotted against duration for which the fly was not 
walking (tmean = 4.85 + 3.0s, APVAmean = 0.04 + 0.74). i, PVA estimate 

before stop compared to PVA estimate at restart during closed-loop behaviour 
with a single stripe (r = 0.67, P= 5 X 10, n = 388, intercept = 0.1 + 0.06, P 
= 0.1, slope = 0.97 + 0.02, P = 0, R? = 0.88). j, Difference in PVA estimate 
before stop and at restart during closed-loop behaviour with a single stripe 

(tmean = 4.97 = 3.0 s, APVAmean = 0.02 + 0.65). k-n, Same as g-j for closed- 
loop walking with the pattern with multiple features. g, r = 0.66, P= 2 X 10 '°, 
n = 146, intercept = 0.2 + 0.1, P = 0.05, slope = 0.9 + 0.03, P= 0, R? =0.85. 
h,r = 0.6, P= 1.6 X 10 4,1 = 146, intercept = 0.19 + 0.11, P = 0.07, slope = 
0.91 + 0.03, P = 2.1 X 10-4, R? = 0.87. i, trean = 6.3 + 7.48, APVAmean = 


10 **, n = 374, slope = 0.96 + 0.02, P = 0, intercept = 0.13 + 0.06, P = 0.02, 
R= 0.91). f, Difference in PVA estimate before stop and at restart for the 10- 
mm ball and duration over which the fly was standing (tean = 6.1 + 4.47 s, 
APVAsnean = 0.04 + 0.9). g, PVA estimate before stop compared to PVA 

estimate before restart during closed-loop behaviour with a single stripe (r = 
0.64, P = 1.5 X 10 “9, n = 388, intercept = 0.03 + 0.07, P = 0.6, slope = 1 + 
0.02, P = 0, R? = 0.85). h, Difference in PVA before stop and before restart in 


0.1 + 0.8. j, tmean = 6.4 + 7.48, APVAmean = —0.04 + 0.8. o-r, Same as 
g-j for closed-loop walking with two stripes. 0, r = 0.6,P = 5.1 10°°,n= 
139, intercept = 0.19 + 0.11, P = 0.08, slope = 0.93 + 0.03, P = 0, R? = 0.88. 
p,r=0.7,P = 14x 10°71, n = 139, intercept = 0.2 + 0.1, P = 0.03, slope = 
0.95 + 0.03, P = 0, R? = 0.9. q, tmean = 5.6 + 5.88, APVAmean = 0.01 + 0.7. 
Yr, tmean = 5.8 + 5.8 8, APVAwnean = 0.1 + 0.66. 
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Strangulation as the primary mechanism for shutting 
down star formation in galaxies 


Y. Peng!?, R. Maiolino* & R. Cochrane’? 


Local galaxies are broadly divided into two main classes, star-form- 
ing (gas-rich) and quiescent (passive and gas-poor). The primary 
mechanism responsible for quenching star formation in galaxies 
and transforming them into quiescent and passive systems is still 
unclear. Sudden removal of gas through outflows’ or stripping’? 
is one of the mechanisms often proposed. An alternative mech- 
anism is so-called “strangulation”, in which the supply of cold 
gas to the galaxy is halted. Here we report an analysis of the stellar 
metallicity (the fraction of elements heavier than helium in stellar 
atmospheres) in local galaxies, from 26,000 spectra, that clearly 
reveals that strangulation is the primary mechanism responsible 
for quenching star formation, with a typical timescale of four 
billion years, at least for local galaxies with a stellar mass less than 
10" solar masses. This result is further supported independently 
by the stellar age difference between quiescent and star-forming 
galaxies, which indicates that quiescent galaxies of less than 10'" 
solar masses are on average observed four billion years after 
quenching due to strangulation. 

Figure 1 qualitatively illustrates the expected evolution of galaxies 
in the two quenching scenarios. A more quantitative analysis is 
provided below. In the scheme of Fig. 1, at t < f, the galaxy is 
subject to gas inflow and forms stars, thus increasing the stellar mass 
and the metallicity of the gas, out of which new stars form; hence the 
metallicity of the newly formed stars also increases with time. The 
metallicity increment is modest, since inflowing gas dilutes the metal 
content in the interstellar medium. At t = fg quenching occurs. In 
the scenario of sudden gas removal (for example, expulsion of gas by 
a strong wind or strong ram pressure stripping) star formation is 
suddenly quenched (Fig. 1a), and the galaxy evolves into a quiescent 
system. In this case the stellar metallicity and stellar mass (Mgtar) of 
the quiescent galaxy are the same as those of its star-forming pro- 
genitor just before quenching. In the case of quenching by stran- 
gulation, star formation can continue, using the gas available in the 
galaxy until it is completely used up (Fig. 1b). During this phase the 
gas metallicity increases more steeply than in the previous case 
because of the lack of dilution from inflowing gas. The stellar mass 
also increases slightly. The general product of strangulation is a 
quiescent galaxy with a stellar metallicity that is much higher than 
its star-forming progenitor, and a slightly higher stellar mass. 

Observationally, obviously we cannot follow the evolution of the 
stellar metallicity of individual galaxies. However, we can statistically 
investigate the metallicity difference between star-forming and qui- 
escent galaxies. This statistical approach has already been successfully 
exploited, for instance, to investigate the dependence of the quenching 
mechanism on mass and environment’’. We have used the Sloan 
Digital Sky Survey (SDSS) spectra of local (redshift z << ~0.1) galaxies 
to extract a subsample of 3,905 star-forming and 22,618 quiescent 
galaxies whose spectra have a signal to noise ratio of S/N > 20 per 
spectral pixel, which ensures a reliable determination of the stellar 
metallicities (details of the sample selection and determination of the 
stellar metallicities are given in the Methods). 


Figure 2a shows the average stellar metallicity of star-forming (blue 
line) and quiescent (red line) galaxies as a function of stellar mass, 
obtained by using a sliding average of 0.2 dex in Mgtar (error bars give 
the 1o uncertainty of the mean stellar metallicity). At a given stellar 
mass, the stellar metallicity of quiescent galaxies is noticeably higher 
than for star-forming galaxies, at least for Mstar < 10''Mo, where Mo 
is the solar mass. This is not what is expected in the case of sudden gas 
removal, but it is qualitatively consistent with the strangulation scen- 
ario. Below we investigate more quantitatively the agreement of the 
data with the strangulation scenario. 

For a system forming stars without any inflow, the temporal 
evolution of the gaseous and stellar metallicity, as well as of the stellar 
mass, can be trivially solved analytically, as discussed in the Methods. 
The key parameters are: (1) the total gas mass at the time of quench- 
ing, Mgas(tq) or, equivalently, the gas fraction fyas(tq); (2) the global 
efficiency of star formation ¢, defined as the star-formation rate 
(SFR) = eMgas (where Mga, is in this case the total gas mass, both 
atomic and molecular); (3) the amount of any outflowing gas that is 
lost (that is, which does not fall back onto the galaxy and is not 
recycled), which can be approximated as being proportional to the 
SFR and parameterized through the so-called outflow mass loading 
factor A, defined as Moutflow = ASFR (note that if part of the gas falls 
back and is recycled'®”’, then 2 would account only for the fraction 
of gas that is lost, that is, it is an ‘effective’ outflow loading factor). 

Weare dealing with differential quantities (in particular, differential 
stellar metallicities AZ.,.,), so the metallicity of the gas and stars at the 
beginning of the quenching (¢ = f,) is unimportant. Yet, the value of 
Mgtar at t = ty is relevant because it defines the gas fraction, and hence 
the gas mass available for further star formation. Indeed, it is obser- 
vationally well known that the gas fraction in star-forming galaxies 
decreases with Mga, (refs 18-21; see Methods for the detailed func- 
tional form). The global star-formation efficiency ¢ is also known from 
observations (see Methods). 

Figure 2b shows the stellar metallicity difference, as a function of 
stellar mass (solid thin lines in colours), as expected from the stran- 
gulation scenario, at five different times after the quenching/strangula- 
tion time (At = t — fg). In this plot we are assuming that the effective 
outflow loading factor is 1 = 0 (that is, any outflowing gas falls back 
and is recycled), implying that the system behaves as a closed box after 
strangulation (no inflow and no effective outflow). The case of sub- 
stantial outflow after strangulation is discussed below. The decline of 
the curves at high M,ta, is primarily a consequence of the gas fraction of 
star-forming galaxies decreasing as a function of Mgtar: massive gal- 
axies have low gas content, and so once such a galaxy has been 
strangled, its available gas can produce few stars relative to those 
already present, and the average stellar metallicity is not much 
affected. The thick black line with error bars shows the observed stellar 
metallicity difference between quiescent and star-forming galaxies 
(that is, the difference between red and blue data in Fig. 2a). The 
observed difference is consistent, within uncertainties, with the stran- 
gulation scenario in which quiescent galaxies at Mstar < 10''Mo are, 
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Figure 1 | Illustration of two different quenching scenarios and their effect 
on stellar metallicities. a, Rapid and complete removal of the gas reservoir 
of the galaxy (for example, from strong outflows or ram pressure stripping) 
results in a passive galaxy with the same mass and same stellar metallicity as 


on average, observed 4 billion years (Gyr) after quenching due to 
strangulation. 

This result can be tested independently by looking at the stellar age 
difference between quiescent and star-forming galaxies, which can be 
measured from the galaxy spectra, independently of the stellar metal- 
licities (see Methods). Figure 3a shows the average age of all galaxies 
(black line), as a function of stellar mass. The average age increases 
steeply with Mgtar, but this is mostly because the fraction of quiescent- 
old galaxies, relative to star-forming young galaxies, increases with 
stellar mass. If the population of galaxies is split into quiescent and 
star-forming samples, then the average ages of the two samples show a 
much flatter dependence on Mgar, as illustrated by the red (quiescent) 
and blue (star-forming) lines in Fig. 3a. Most importantly, as illu- 
strated in Fig. 3b, the age difference between the two populations is 
reasonably constant and equal to about 4 Gyr. The latter is the same 
age difference required to explain the stellar metallicity difference in 
the strangulation scenario; it therefore further and independently con- 
firms this scenario. The relatively long timescale of about 4 Gyr is also 
supported by independent observations and simulations”. 
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Figure 2 | Stellar metallicities for star-forming and quiescent galaxies. 

a, Average stellar metallicity as a function of stellar mass for all star-forming 
galaxies (thick blue line with error bars) and all quiescent galaxies (thick red 
line with error bars) for galaxies at <z> ~ 0.05. Error bars correspond to 
the lo error on the mean value. b, Average metallicity difference between all 
star-forming and all quiescent galaxies (thick black line with error bars). 
Error bars on the black line indicate the 1o uncertainty in the metallicity 
difference. The metallicity difference decreases with increasing stellar mass. 


its star-forming progenitor. b, In the strangulation scenario, the galaxy can 
continue to form stars from the available enriched interstellar medium and, 
as a consequence, increase its stellar mass and stellar metallicity. 


We also investigate the case of a substantial ‘effective’ outflow, after 
strangulation, by setting 2 = 1, which is a typical loading factor 
observed in star-forming galaxies****. Figure 4 shows the resulting 
AZ.tar Curves, which are completely inconsistent with the observed 
data. This further confirms that gas removal by outflows plays a minor 
part in quenching galaxies. This includes any external environmental 
effect such as gas removal in satellite galaxies when falling into a more 
massive halo, or feedback process such as outflow driven by an active 
galactic nucleus. 

Overall, our results strongly support the scenario in which local 
quiescent galaxies with Mgtar < 10''M qs (that is, the vast majority of 
galaxies) are primarily quenched as a consequence of strangulation. 
However, this analysis does not clarify what the strangulation mech- 
anism is (for example, hot halo environmental strangulation or stran- 
gulation via various preventive feedback mechanisms, such as 
circumgalactic gas heating”). Additional analysis is needed to invest- 
igate the strangulation mechanism (for example, by studying the cent- 
ral/satellite and environmental dependence; see Methods subsection 
‘Stellar metallicity for central and satellite galaxies’). We also note that 
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It reaches the maximum value around 0.4 dex for galaxies at Mgtar ~ 10°°Mo 
and becomes negligible at Mstar = 10''Mo. The coloured lines show the 
metallicity difference predicted by a simple close-box model at different times 
At after strangulation. Solid lines are for the final mass (at time t = t, + Ad), 
while dashed lines are for the mass at strangulation (t = t,). The observed 
mass-dependent metallicity difference between quiescent and star-forming 
galaxies (thick black line) can be very well reproduced by a close-box model 
with a constant At ~ 4 Gyr, largely independent of stellar mass. 
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Figure 3 | Stellar ages for star-forming and quiescent galaxies. 

a, Luminosity-weighted stellar age as a function of stellar mass for star-forming 
(blue line), quiescent (red line) and all galaxies (black line) at <z> ~ 0.05. 
Error bars are the 1o error on the mean value. The average age for all galaxies 
(black line) strongly depends on the stellar mass, largely due to the fact that 
the red fraction (that is, the mixture of quiescent and star-forming galaxies) 
strongly depends on stellar mass. However, the dependence on stellar 


our results do not imply any claim about the morphological changes 
of the galaxy population, as it is not completely clear whether the 
morphological transformation is associated with star-formation 
quenching”. 

The data presented in this paper cannot shed light on the quenching 
mechanism at Myar = 10''Mo. At Mgtar ~ 10''Mo the stellar metal- 
licity of quenched galaxies is similar to the stellar metallicity of star- 
forming galaxies (Fig. 2), which can be interpreted equally well as 
quenching by sudden gas removal (such as by outflows; Fig. la) or 
as quenching by strangulation of gas-poor massive galaxies (indeed, in 
massive galaxies the small amount of available gas, as shown in 
Extended Data Fig. 1a, does not allow much star formation, and hence 
there is little variation of the stellar metallicity, even if the galaxy is 
strangled; see also discussions in Methods subsection ‘Fraction of gal- 
axies quenched by rapid gas removal’). At even higher stellar masses 
our analysis is not feasible, since star-forming galaxies with Mya. > 
10''M are extremely rare in the local Universe, hence preventing our 
statistical approach. To shed light on the quenching mechanism of 
massive galaxies a similar analysis has to be performed at high z, where 
massive star-forming galaxies are abundant and gas-rich. 

The results obtained in this paper apply on average to the bulk of 
the local galaxy population. However, for individual galaxies, other 
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Figure 4 | The effect of outflows on stellar metallicity evolution. As for 
Fig. 2b, but for the case of an effective mass-loading factor 2 = 1 after 
strangulation. In this case, the observed stellar metallicity difference (black 
thick line) is clearly not reproduced by the model, further suggesting that 
outflows do not play a major part. Error bars on the black line indicate the 1a 
uncertainty in the metallicity difference. 
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mass becomes much weaker once the whole sample is split into star-forming 
and quiescent galaxies. b, Average age difference between quiescent and 
star-forming galaxies as a function of stellar mass. Error bars on the black 
line indicate the 1o uncertainty in the age difference. Remarkably, the 

age difference for all galaxies is largely independent of mass, with a mean value 
of around 4 Gyr, which is consistent with the mass-independent time At 
from strangulation required to explain the difference in stellar metallicities. 


quenching mechanisms such as fast gas removal via outflows (which 
can also help to explain the «-element enhancement in massive ellipt- 
ical galaxies) and environmental effects (such as ram-pressure strip- 
ping, tidal stripping, harassment and mergers), may work together 
with, or cause, strangulation to shape the detailed quenching process 
(see Methods subsection ‘Fraction of galaxies quenched by rapid gas 
removal’). These additional quenching mechanisms may modify the 
amount of stellar metallicity enhancement, the quenching timescale, 
and/or the gas content, which may all contribute to the scatter in the 
stellar metallicity and age differences between star-forming and qui- 
escent galaxies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Sample and observational data. The parent sample of galaxies analysed in this 
paper is the SDSS DR7 sample that is used in refs 15 and 31 for similar statistical 
investigations of the quenching process. Briefly, it is a magnitude-selected sample 
of galaxies in the redshift range of 0.02 < z < 0.085 that have clean photometry and 
Petrosian SDSS r-band magnitudes" in the range 10.0 < r < 18.0 after correcting for 
Galactic extinction. The parent photometric sample contains 1,579,314 objects 
after removing duplicates, of which 238,474 have reliable spectroscopic redshift 
measurements. As a consequence of the relatively broad redshift range 0.02 < z 
< 0.085, the projected physical aperture of the SDSS spectroscopic fibre changes 
substantially across the sample. To test whether there are any noticeable aperture 
effects, we further split the whole redshift range into two narrower redshift ranges, 
at 0.02 < z < 0.05 and 0.05 < z < 0.085. The results are shown in the Extended 
Data Fig. 2. It is clear that the results change little as a function of redshift. 

Each galaxy is weighted by 1/TSR X 1/Vinax» Where TSR is a spatial target 
sampling rate, determined using the fraction of objects that have spectra in the 
parent photometric sample within the minimum SDSS fibre spacing of 55 arcsec of 
a given object. The Vinax Values are derived from the k-correction program v4_1_4 
(ref. 32). The use of Vinax Weighting allows us safely to include representatives of 
the galaxy population down to a stellar mass of about 10°Mo. 

The stellar masses are determined from the k-correction code using population 
synthesis models”’ and a Chabrier initial mass function. The galaxy population is 
then divided into star-forming and passive galaxies, based on their spectroscopic 
emission line classifications and rest-frame (U — B) colours. Star-forming galaxies 
are defined to have the classification flag (that is, the “iclass” keyword), in the SFR 
catalogue*’, set to 1. This also excludes those objects hosting an active galactic 
nucleus, for which the derived metallicities are probably not reliable. Passive galaxies 
are defined to have their (U—B) colours redder than a threshold colour that is given 
by equation (2) in ref. 15 and without Hx emission (that is, undetected with S/N < 3). 

The stellar metallicities and r-band weighted stellar ages were derived by ref. 35 
from the spectral absorption features of SDSS Data Release 4 spectra, which are 
then cross-matched with our parent galaxy sample. We refer to ref. 35 for a 
detailed description of the method. Here, we mention only that the method con- 
sists of measuring the strength of a set of carefully selected spectral absorption 
features, including several well calibrated Lick indices and the 4,000 A break. Then 
they* compute the median-likelihood estimates of the stellar metallicities and 
r-band light weighted ages by comparing the spectral absorption features to a 
large library of 150,000 Monte Carlo realizations spanning a full range of phys- 
ically plausible star-formation histories. The stellar metallicities are derived for 
both passive and star-forming galaxies, for which the contamination of stellar 
absorption features by nebular emission has been carefully removed. To ensure 
a reliable metallicity measurement, we restrict our galaxy sample to galaxies that 
have a median S/N per pixel of at least 20 over the whole spectrum. This ensures 
the average uncertainty of the stellar metallicity and light-weighted age to be less 
than +0.15 dex. The requirement on the S/N could, in principle, be lowered to 
increase the statistics, but this would result in higher uncertainties in the metalli- 
city and age measurements. Since the statistical errors on the mean metallicity and 
mean age are already reasonably small with the above S/N selection, the analysis 
benefits more from solid measurements than from improved statistics. 

With the selection criteria given above, the final sample consists of 22,618 
passive galaxies and 3,905 star-forming galaxies. All of these galaxies have reliable 
stellar mass, stellar metallicities and age measurements. 

Metallicity evolution during quenching via strangulation. The quenching pro- 
cess via strangulation can be quantitatively described by using the analytical frame- 
work discussed in refs 36 and 37. These simple analytical models take into account 
the key physical processes of inflow, star formation, outflow and metal production. 

We assume the star-formation law holds in the same way before and during the 
quenching process, that is, the instantaneous average SFR of the galaxy is always 
related to the gas mass present within the galaxy as: 


SER = &Mgas (1) 


where M,,s is the total gas mass (both atomic and molecular). In fact, equation (1) 
can be regarded as the definition of effective, global star-formation efficiency ¢ or, 
more properly, the inverse of the total gas depletion timescale Taep = Mgas/SFR = 
1/e. Note that, from equation (1), the specific SFR can be expressed as: specific 
SFR = SFR/Metar =€é Mgas/Mstar- 

The mass-loss rate of the galaxy Y, that is, the outflow rate, is very likely to be 
closely related to the average SFR of the galaxy. Analogously to equation (1), we 
link these two quantities together via 4, to give: 


W = jSFR (2) 


where / is the mass-loading factor. Similar to ¢, equation (2) can be regarded as the 
definition of 4. It should be noted that strangulating the gas inflow does not 


necessarily turn the gas regulator model to a simple close-box model, since the 
galaxy can continue to have an outflow. 

The general evolution of the gas metallicity Z,,., without assuming any equi- 
librium condition, is given by equation (32) in ref. 37, that is: 


AZ gas © 


ye—(Zeas —Zo) (3) 
dt 2 Megas 


where y is the average yield per stellar generation and is assumed to be a constant, 
@ is the inflow rate, Zo is the metallicity of the infalling gas and M,,; is the gas mass. 
When quenching via strangulation starts, is set to zero and equation (3) can be 
easily solved: 


Zeas(t) =Zeas(tq) + yet (4) 


where Z,,s(tq) is the gas metallicity at the time when quenching begins. It is clear 
from equation (4) that AZ,,5 = yet, that is, for a constant yield and star-formation 
efficiency, the gas metallicity increase is simply proportional to time. We note that 
when the inflow is truncated, the gas metallicity is independent of the outflow. 

From equation (4), the logarithmic increase of the gas metallicity when quench- 
ing begins is given by: 


yet 
Pani (i) (5) 

Before the start of the quenching, according to equation (35) in ref. 37, Zgas is 
proportional to the yield y if Z) ~ 0. By inserting it into the right-hand side of 
equation (5), y will cancel out from both the numerator and denominator. 
Therefore, when the inflow is truncated, the amount of logarithmic increase of 
the gas metallicity is independent of the yield. Hence, any uncertainty on the yield 
is completely irrelevant to our analysis. 

The general evolution of the stellar metallicity Zsta,, without assuming any 
equilibrium condition, is given by equation (40) in ref. 37 as: 


AZ star 
dt 


log Zgas(t) — log Zgas(tq) = log[1 + 


sSFR x (1 —R)(Zgas —Zstar) 


(6) 
EMgas 


Motar 
where R is the fraction of the mass of the newly formed stars that is quickly 
returned to the interstellar medium through stellar winds and supernovae. It is 
clear from equation (6) that the stellar metallicity simply evolves towards the gas 
metallicity on a timescale controlled by (specific SFR) *. 

The change of stellar mass of the galaxy per unit time is given by: 

dMgtar 

dt 
where (1—R)SER is the net SFR that contributes to the net stellar mass increase of 
the galaxy, that is, the fraction of newly produced stars in the form of long-lived 

stars. The change of the gas mass of the galaxy per unit time is given by: 

dMgas 

dt 


( 1— R)(Zeas a Zstar) 


=(1—R)SER 


(1—R) x SERV 


= —(1—R+A)eMyps 


To calculate the change of the stellar metallicity during quenching, we need to 
know, at a given stellar mass, the gas mass (or equivalently the gas fraction), the 
star-formation efficiency ¢ and the mass-loading factor /. 

For A, we first assumed that, during the strangulation process, the galaxy can 
recycle all the outflow gas by setting 1 = 0, that is, we assumed a close-box model. 
Then, as discussed in the text, we also investigated the case of A = 1 during 
strangulation. 

Both gas fraction and star-formation efficiency ¢ (or equivalently the gas deple- 
tion timescale tgep) have been measured observationally'*'. In fact, the predicted 
gas fraction and ¢ determined using the model in ref. 37 match the latest observa- 
tions in the local Universe”® extremely well, as shown in the Extended Data Fig. 1. 
We stress again that the gas fraction and star-formation efficiency in all our 
calculations are defined with the total gas mass (including both atomic and 
molecular). While in some previous work, such as ref. 38, the average gas depletion 
time is found to be constant at about 2 Gyr, which refers to the molecular gas 
depletion time (see subsection ‘Effect of constant ¢ on stellar metallicity evolution’ 
below). 

The change in stellar metallicity as a function of stellar mass at different times At 
after strangulation is shown in Fig. 2b (for 2 = 0). During strangulation galaxies 
will continue to form stars with the available gas following the star-formation law 
given by equation (1) and their stellar mass will hence continue to grow. The 
coloured solid lines show the stellar metallicity increase as a function of the final 
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stellar mass, while the coloured dashed lines show the stellar metallicity increase as 
a function of the stellar mass at the epoch when the strangulation starts. As shown 
in Fig. 2b, the stellar metallicity increase is slightly larger if the stellar mass con- 
sidered is the final stellar mass, but there are no substantial differences between 
these two stellar masses. 

At a given stellar mass, the amount of stellar metallicity increase is nearly 
proportional to the time At elapsed from the beginning of strangulation. At a 
given At the stellar metallicity increase is larger for low-mass galaxies than for 
more-massive galaxies. This is because the variation of stellar metallicity depends 
on the size of the gas reservoir, that is, the gas fraction, at the epoch when the 
strangulation starts (the larger the gas reservoir, the more metals can be produced 
in the strangulation phase). For massive galaxies, which have a low gas fraction 
(Extended Data Fig. 1a), the relative amount of gas available for star formation is 
small, while the existing stellar population (with relatively low metallicity) is large 
compared to the amount of new stars (with higher metallicity) that will form 
during the strangulation. Therefore, although the star-formation efficiency is 
higher for massive galaxies (Extended Data Fig. 1b), the increase in stellar metal- 
licity is smaller for massive galaxies than for low-mass galaxies (Fig. 2b). 

It is evident, as discussed in the main text, that the observed metallicity differ- 
ence can be very well reproduced by a simple close-box model with a constant 
mass-independent At ~ 4 Gyr across the entire observed range of stellar masses, 
which is consistent with the age difference between the two populations (Fig. 3b). 
Westress that a different stellar metallicity calibration and age calibration method 
may give different scales and different slopes of the stellar mass versus metallicity/ 
age relations, but the results illustrated above are preserved regardless of the 
adopted calibration. This is because our results are mainly based on metallicity 
differences and age differences between star-forming and passive galaxies, that is, 
we are dealing with differential quantities, and therefore uncertainties in the 
metallicity/age scale are much less critical than in studies dealing with absolute 
quantities. 

Finally, we discuss one second-order effect that we have not taken into consid- 
eration, but which would further reinforce our results. The star-forming progeni- 
tors of passive (quenched) galaxies observed locally should be star-forming 
galaxies at z ~ 0.5 (that is, 4 Gyr ago), and not the star-forming population 
observed locally in SDSS. Unfortunately, SDSS does not have the sensitivity to 
deliver star-forming galaxies at high redshifts in large numbers and with the same 
S/N as local star-forming galaxies. However, star-forming galaxies at z ~ 0.5 
should have a metallicity even lower than local galaxies. Therefore, if any, the 
metallicity difference between passive and star-forming galaxies observed in Fig. 2 
should be even larger. However, this effect is expected to be very small, since the 
mass-metallicity relation evolves very little from z = 0 to z = 0.5 (refs 39 and 40). 
Stellar metallicity for central and satellite galaxies. Since a distinction between 
central and satellite galaxies appears in many theoretical models for the evolution 
of galaxies and may potentially shed light on the strangulation mechanism, we 
further divide the whole sample into central and satellite galaxies and the results 
are shown in the Extended Data Fig. 3. As discussed in ref. 31, there are difficulties 
in the identification of true central galaxies due to over-fragmentation of groups by 
the group-finding algorithm, which would lead to some satellites being misiden- 
tified as central galaxies. This effect is expected to be most severe for low-mass 
galaxies in high-density regions. To obtain a clean sample of true central galaxies, 
we further select the central galaxies in the fields, where their over-densities are 
below the mean over-density of the local Universe. 

The stellar metallicity enhancement of satellites is slightly larger than that of 

central galaxies at Mstar < 10'°M >. This suggests that low-mass satellites are likely 
to suffer more from the strangulation than central galaxies and hence implies an 
environmental origin of strangulation at these low masses (for example, inflow of 
gas being halted as satellite galaxies plunge into the hot halos). At masses Mgtar > 
10'°M.5 no difference between central galaxies and satellites is detected. This 
suggests that at high masses the strangulation process may operate similarly for 
both central galaxies and satellites. A more detailed analysis, in different environ- 
ments, is required to unveil the origin of strangulation at high masses, which will 
be presented in a future work. 
Fraction of galaxies quenched by rapid gas removal. We argue here that the 
majority of local quiescent galaxies with Mtar < 10''M @ is primarily quenched as 
a consequence of strangulation. This is a statistical statement, based on the average 
properties of the galaxy population. It is, however, interesting to quantify the 
fraction of galaxies whose data may potentially allow for rapid quenching by 
sudden gas removal (which does not cause a metallicity change) as an alternative 
viable process. 
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In each panel of Extended Data Fig. 4, we show the probability density function 
(PDF) of the stellar metallicity of star-forming galaxies (blue line) and that of the 
passive galaxies (red line) at a given stellar mass. Each PDF is normalized (that is, 
the area beneath each curve is unity). The overlapping region of the two PDFs is 
shaded (light red). The fraction of the shaded over the total area given by the blue 
PDF is noted as fyyax in the label. If all star-forming galaxies were eventually to be 
quenched, the star-forming PDF (blue line) should eventually evolve into the 
passive PDF (red line). 

The evident difference between star-forming and passive metallicity implies 
that strangulation is the primary quenching process. The opposite is not neces- 
sarily true: a similar metallicity for star-forming and passive galaxies may be 
caused either by sudden gas removal (such as outflows and gas stripping) or 
strangulation of galaxies with modest gas content. Furthermore, we also note that 
if the metallicity of a star-forming galaxy is similar to those of passive ones, this 
could mean that this galaxy initially had large gas content, was strangled, and we 
are observing it during the last stage of star formation in its strangulation phase. 
Therefore the shaded area, and hence fax, give a very conservative upper limit to 
the fraction of galaxies for which sudden gas removal can potentially be an alterna- 
tive quenching mechanism. 

fmax is 50% at low masses, confirming that most of the galaxies at low masses 
must be quenched by strangulation. We recall, as discussed above, that this is a 
very conservative approach. finax progressively increases with increasing stellar 
mass. This effect is due to the fact that the observed stellar metallicity enhancement 
decreases with increasing stellar mass, as shown in Fig. 2b, in which the lines with 
different At converge to zero with increasing mass. This implies that, as already 
discussed, the constraining power of the stellar metallicity data on the quenching 
mechanism decreases with increasing mass. In the most-massive cases of Mgtar > 
10''Mo, the stellar metallicity data cannot shed light on the quenching mech- 
anism, because the small metallicity difference can be interpreted equally well as 
rapid quenching by sudden gas removal or as quenching by strangulation. 
Effect of constant ¢ on stellar metallicity evolution. As discussed in the subsec- 
tion ‘metallicity evolution during quenching via strangulation’, although the star- 
formation efficiency ¢ defined with the total gas mass (¢ = SFR/(My; + Myp)) is 
very unlikely to be constant, it is useful to test the validity of the result further in 
Fig. 2b against a putative constant ¢. We keep all other parameters unchanged in 
the model, except that when strangulation starts we set ¢ to be constant at 
0.5 Gyr’, that is, a constant gas depletion timescale of tgep = 2 Gyr. The results 
are shown in the Extended Data Fig. 5. It is clear that at Megtar = 10°Mo, 
the prediction from the model is very similar to Fig. 2b and is still consistent with 
At ~ 4 Gyr or 5 Gyr. This is because the average ¢ in the Extended Data Fig. 1b at 
Metar x 10'°Mo is roughly 0.5 Gyr!. However, below about 10’°M, the model 
now predicts a much steeper metallicity increase, owing to the adopted constant 
value of ¢ = 0.5 Gyr’ that is much higher than the one shown in the Extended Data 
Fig. 1b. This suggests that, to achieve the same observed metallicity enhancement, a 
higher ¢ will need a shorter At. In other words, the strangulation process can be fast 
(for example, less than 1 Gyr) if the ¢ is large (for instance, at higher redshifts”'). 
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Extended Data Figure 1 | Gas fraction and star-formation efficiency for 
star-forming galaxies. a, The observed total gas fraction for local galaxies 
determined using a constant CO-to-H, conversion factor Xco (black dashed 
line) and an H-band luminosity-dependent conversion factor (black solid line) 
in Boselli et al.”°. The predicted total gas fraction (molecular and atomic) for 
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star-forming galaxies as a function of stellar mass from the Peng and Maiolino” 
model at z ~ 0.05 (red solid line). b, Star-formation efficiency ¢, defined as 

é = SFR/Mgas that is, the reverse of the gas depletion timescale, as a function 
of stellar mass. 
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Extended Data Figure 2 | Stellar metallicity difference for different redshift metallicity difference changes little as a function of redshift, that is, as a function 
bins. To investigate any aperture effects, the sample over the whole redshift of projected aperture. The error bars on each line indicate the 1o uncertainty in 
range 0.02 < z < 0.085 is further divided into two narrower redshift ranges of _ the metallicity difference. 

0.02 < z< 0.05 and 0.05 < z < 0.085. It is clear that the derived stellar 
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Extended Data Figure 3 | Stellar metallicity difference for central and galaxies at Mgtar < 10'°M © (suggesting that environment may play a part in the 
satellite galaxies. The whole sample is further divided into central galaxiesand _ strangulation mechanism at these low masses), while no detectable difference 
satellites. The orange line shows the central galaxies in the field, which between them is seen at higher stellar masses. The error bars indicate the 1o 


represents a clean sample of true central galaxies, as explained in the text. The —_ uncertainty in the metallicity difference. 
stellar metallicity enhancement of satellites is slightly larger than that of central 
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Extended Data Figure 4 | Probability density function of star-forming and —_ The overlapping region of the two PDFs is shaded (light red). The fraction of 
passive galaxies. In each panel the blue line shows the probability density the shaded area over the total area given by each of PDF (fmax) gives the 
function (PDF) of the stellar metallicity of star-forming galaxies for a given maximum fraction of galaxies for which rapid gas removal may be an allowed 
stellar mass and the red line shows the corresponding PDF of passive galaxies. alternative quenching mechanism. 
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Extended Data Figure 5 | Effect of a constant star-formation efficiency on — mass-dependent metallicity enhancement is still consistently At ~ 4 or 5 Gyr, 
stellar metallicity evolution. As for Fig. 2b, but for the case of a constant star- _ while at lower stellar masses it requires a shorter Af, as explained in the text. 


formation efficiency of ¢ = 0.5 Gyr ' (that is, a constant gas depletion timescale _Error bars on the black line indicate the 1¢ uncertainty in the metallicity 
of Tgep = 2 Gyr) after strangulation. At Mstar > ~10!°Mo, the observed difference. 
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Electron pairing without superconductivity 


Guanglei Cheng’”, Michelle Tomezyk"”, Shicheng Lu'’, Joshua P. Veazey'+, Mengchen Huang"”, Patrick Irvin”, Sangwoo Ryu’, 
Hyungwoo Lee®, Chang-Beom Eom’, C. Stephen Hellberg* & Jeremy Levy’? 


Strontium titanate (SrTiO3) is the first and best known supercon- 
ducting semiconductor’. It exhibits an extremely low carrier den- 
sity threshold for superconductivity*, and possesses a phase 
diagram similar to that of high-temperature superconduc- 
tors**—two factors that suggest an unconventional pairing mech- 
anism. Despite sustained interest for 50 years, direct experimental 
insight into the nature of electron pairing in SrTiO; has remained 
elusive. Here we perform transport experiments with nanowire- 
based single-electron transistors at the interface between SrTiO; 
and a thin layer of lanthanum aluminate, LaAlO3. Electrostatic 
gating reveals a series of two-electron conductance resonances— 
paired electron states—that bifurcate above a critical pairing field 
B, of about 1-4 tesla, an order of magnitude larger than the super- 
conducting critical magnetic field. For magnetic fields below B,, 
these resonances are insensitive to the applied magnetic field; for 
fields in excess of B,, the resonances exhibit a linear Zeeman-like 
energy splitting. Electron pairing is stable at temperatures as high 
as 900 millikelvin, well above the superconducting transition tem- 
perature (about 300 millikelvin). These experiments demonstrate 
the existence of a robust electronic phase in which electrons pair 
without forming a superconducting state. Key experimental signa- 
tures are captured by a model involving an attractive Hubbard 
interaction that describes real-space electron pairing as a precursor 
to superconductivity. 

SrTiO; superconducts at temperatures below the superconducting 
transition temperature T, ~ 300 mK and at electron densities’ as low 
as 10'7cm *. Electrons can be introduced via doping (for example, Nb 
or La), oxygen vacancies’ or electrolytic gating’. In 1969, it was pre- 
dicted that low-density superconductors (specifically, Zr-doped 
SrTiO3) should exhibit unconventional real-space pairing in the 
absence of superconductivity. At higher temperatures, electrons were 
postulated to form tightly bound pairs; below the Bose-Einstein con- 
densation (BEC) transition temperature, superconductivity was pre- 
dicted to emerge. 

The superconducting properties of SrTiO; have been previously 
investigated by electrical transport’, tunnelling spectroscopy’, and 
the Nernst effect”. New insights into the superconducting properties 
of SrTiO; come from heterointerfaces* that enable transport in 
reduced dimensions. The interface between TiO -terminated SrTiO; 
and a thin layer of LaAlO; supports a two-dimensional electron liquid® 
that exhibits electric-field-tunable superconductivity’’. Recently, a 
pseudogap phase similar to that seen in high-T. superconductors 
was observed at the LaAlO;/SrTiO; interface using planar tunnelling 
spectroscopy*. 

The superconducting single-electron transistor (SET), consisting of 
an electrically gated superconducting quantum dot (QD) coupled to 
superconducting leads by tunnelling barriers, presents a particularly 
powerful tool for probing fundamental properties of superconduc- 
tors’. Transport signatures of metallic superconducting islands 
include even-odd parity effects, Cooper pair tunnelling, and parity- 
affected superconductivity'’. Generally, transport characteristics 


depend on the relative magnitudes of the charging energy E,, super- 
conducting gap energy 4, and orbital level spacing SE in the QD. 

Here we describe quantum transport measurements on LaAlO;/ 
SrTiO3 SETs fabricated by conductive atomic force microscope litho- 
graphy’**. The devices are constructed from three basic elements: 
superconducting nanowires“, nanoscale potential barriers created by 
conductive atomic force microscope erasure” and electrical side gates 
(see Methods). Figure 1a shows a schematic of a typical structure, 
consisting of a nanowire (between leads 1 and 5) of width w ~ 5 nm, 
three voltage probes (leads 2-4) and a side gate. Voltage leads are 
located a distance L,, = 2.5 lum apart, separating the main channel into 
two segments. The upper segment (between leads 2 and 3) is ‘open’, 
that is, without barriers, while the lower nanowire segment forms an 
Leap = 1 um QD bounded by two barriers. A side gate tunes the chem- 
ical potential of both the upper wire and the QD, and modulates the 
tunnel coupling between the QD and the external leads. 

The low-temperature (T = 50 mK) differential conductance (dJ/dV) 
versus side-gate voltage (V.,) measurements for the nanowire QD and 
open wire show contrasting transport characteristics (device A, Fig. 1b, 
c). While the open wire (Fig. 1b) exhibits superconductivity" at all Veg 
values shown, the QD (Fig. 1c) exhibits a sequence of diamond-shaped 
insulating regions for V,..< —10mV. The conductance increases by 
several orders of magnitude only when an available state in the QD is 
aligned within kgT (kg, Boltzmann’s constant) of either the source or 
the drain chemical potential; this condition defines the diamond- 
shaped insulating regions in Fig. 1c. Within the diamonds, conductance 
through the QD is highly suppressed (dI/dV < 10° * e*/h, where e is the 
electronic charge, and h is Planck’s constant). In the regime 
Veg >0OmV, where the barriers are highly transparent, supercurrent 
recovers and flows resonantly through the QD. 

Generally, the ‘addition energy’ (the difference of chemical poten- 
tials 4, and piy+1) required to change the charge state of a QD from N 
to N + 1 electrons is the sum of both the classical charging energy E, 
and the orbital energy SE of the device: E,aa(N) = E.(N) + 5E(N). For 
QD systems involving semiconductors, carbon nanotubes or supercon- 
ductors’®, E,aq is usually dominated by E,, resulting in regularly spaced 
Coulomb diamonds. In device A, E,aq decreases (non-monotonically) 
from 640 pleV (at Vig = —47 mV) to 210 peV (at Vsg = —13 mV). The 
level spacing is non-uniform, signifying that orbital contributions dom- 
inate the addition energy. Resonant supercurrent flowing through the 
QD is only observed when the addition energy E,aq falls below the 
superconducting gap* 4 ~40peV (for example, at V,,= —39mV 
and —19mV), consistent with Anderson’s criterion for nanoscale 
superconductivity (6E < 4)’”. 

Figure 2a—e shows how the conductance diamonds evolve as a func- 
tion of an applied out-of-plane magnetic field B. At B = 0 T, two zero- 
bias peaks (ZBPs) are visible, with some narrowing of the lineshape 
taking place at B=1T. The diamond pattern remains relatively 
unchanged at B= 2 T, though the size of the diamond is slightly 
reduced. At B=3T, new diamonds emerge and separate as the 
magnetic field is increased further to B = 4. A high-resolution scan 
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Figure 1 | Device schematic and transport characteristics. a, Device 
schematic. The nanowire width w = 5 nm, the nanowire QD length is 1 um, 
and barriers are 0.75 jim away from the sense leads 3 and 4. The length of the 
open wire is 2.5 jim, equal to the nanowire QD length plus total distances from 


of the conductance versus gate voltage at zero bias (Fig. 2f) enables the 
ZBP to be fitted and tracked versus magnetic field. A global shear of all 
of the ZBP splittings above B, is observed and offset in Fig. 2f (see 
Extended Data Fig. 4). This shear, which appears in 60% of the total 
devices and is possibly attributable to orbital effects'', does not influ- 
ence the analysis of B,. The ZBP at V.,= —27mV splits above a 
critical pairing field B, = 1.8 + 0.1 T. The magnetic field at which this 
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barrier to sense leads. b, Device A dI/dV characteristics (colour coded) as a 
function of four-terminal voltage V23 and side gate voltage V,, in the open wire. 
c, dI/dV characteristics of the nanowire QD measured simultaneously with data 
shown in b. 


pairing transition occurs is one order of magnitude larger than the 
upper critical field for superconductivity (H..), UpH 2 ~ 0.2 T (here pug 
is the vacuum permeability). The ZBPs at V..=—19mV and 
Veg = —17mV have successively smaller values for B, and show pro- 
nounced superconducting resonances below |B| < 0.2 T (indicated by 
the red arrow in Fig. 2f). For |B| > B,, the energy difference between 
the split peaks increases linearly (Zeeman-like) with magnetic field 
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Figure 2 | Out-of-plane magnetic field dependence of device A transport 
characteristics at T= 50 mK. a-e, Top panels, di/dV dependence on V34 and 
Vg at B= 0 to 4 T. New diamonds emerge at B = 3 T in d. Colour scale (top 
right), 0-80 11S. Bottom panels, zero-bias line profiles in a-e. f, Top panel, 
magnetic field dependence of ZBPs. All the ZBPs split above some critical 
pairing fields, By. Colour scale, 0-40 11S. Bottom panel, line profile (black 
markers) of ZBP at B = 3.8 T in the top panel, indicated by the horizontal black 
arrow. Red line is the fit to extract peak locations (see Methods). Right panel, 
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line profile at V., = —19 mV indicated by the vertical black arrow. The sharp 
peak at B = 0 is due to superconductivity (SC). g, Energy difference Ez of two 
Zeeman splitting branches of the ZBP at V,, = —27 mV. B, and g factor can be 
extracted from the intercepts and slopes in the linear fits. h, E, dependence on 
rescaled V;, for all available ZBP splittings in four devices A, B, C and D, where 
Vig= (Vig— Vin) / 


minimum ZBP locations of each device. E, roughly decreases with increasing Vig. 


a _— vain) , and Ve" and ve" are maximum and 
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Ez = gutp(B — Bp). The Landé g-factor g = 1.2 + 0.1 (Fig. 2g) is calcu- 
lated from the slope by taking into account the experimentally deter- 
mined coupling factor « = 0.10 + 0.01 eV Vv! (Methods). At much 
larger magnetic fields the Zeeman-split ZBPs occasionally intersect 
and ‘lock’ together (re-entrant pairing) before separating again (for 
example, at Vig =-—25mV). The energy associated with B,, 
E, = gupBp, ranges between 100 peV and 900 pleV for the four devices 
shown here (Fig. 2h) and decreases non-monotonically with increas- 
ing V., for each device. 

Temperature-dependent transport measurements (Fig. 3) show B, 
to be nearly independent of temperature up to the highest value mea- 
sured (T' = 900 mK). Of four devices, only device B, which shows the 
lowest B,, exhibits a threefold suppression at T = 900 mK. Figure 3a 
shows a representative conductance map versus V., and B acquired at 
T= 100mK. The conductance at B=6T shows two well-resolved 
split peaks at T= 100 mK. As the temperature is increased while the 
magnetic field is held constant at B=6T, the side-gate splitting 
between the two peaks AV., increases, shown in Fig. 3c. Assuming 
that the g-factor is constant over this temperature range, this result 
implies that dB,,/ dT <0, which is also consistent with the fitting result 
summarized in Fig. 3d. 

The quantum transport behaviour of LaAlO3/SrTiO3 nanowire 
QDs contrasts sharply with conventional Coulomb blockade beha- 
viour in other semiconductor nanostructures’®. Experiments in Al- 
based superconducting SETs report pair tunnelling only in the super- 
conducting state, and in the pair-tunnelling condition E.< 24 when 
the pairing energy dominates over the Coulomb charging energy". 
The high permittivity of SrTiO; leads to a significant reduction of 
the charging energy for LaAlO3/SrTiO; nanowire SETs, enabling them 
to operate in the pair-tunnelling regime (see Methods for estimates 
and analysis). The observed ZBP splitting indicates that electron pair- 
ing is the preferred ground state that persists in magnetic fields far 
larger than figH_, above which superconductivity is suppressed. 

The existence of electron pairs outside the superconducting regime 
does not automatically imply that the electron pairing described here 
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Figure 3 | Temperature dependence of B,. a, Out-of-plane field dependence 
of ZBPs from device B at T= 100 mK. Trace of red dots reveals the actual peak 
locations extracted by fitting. b, Line profiles at B = 6 T of different 
temperatures from 100 mK (blue) to 900 mK (red) with 100 mK spacing. Red 
triangles mark actual peak positions and AV... is the difference between two 
splitting peak positions. Curves are offset for clarity. c, JVs in b as a function of 
temperature. A larger AV., at higher temperatures indicates lower B,,. 

d, Temperature dependence of B, for the most isolated ZBP splittings in four 
devices B, C, Dand E. By, in device B, which is the lowest among the four devices, 
decreases non-monotonically with increasing temperature. Error bars, s.e.m. 
from the linear fitting errors of the positive and negative critical pairing fields. 
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contributes to the superconductivity itself. It would, however, be a 
remarkable coincidence for the two phenomena to be superimposed 
without any interrelationship. In fact, electron pairing and supercon- 
ductivity are demonstrably linked. The vertical linecut in Fig. 2f shows 
a sharp superconducting enhancement of the ZBP at V., = —19 mV. 
Like the other ZBPs, the paired electron state bifurcates at By, ~2T. 
This marked enhancement of conductance in the superconducting 
regime demonstrates that the electron pairs couple strongly to the 
superconducting leads. 

Alternative explanations for the ZBP splittings have been consid- 
ered. The Kondo ridge in Coulomb diamonds can split above a critical 
magnetic field'*. However these splittings are generally observed at 
non-zero biases; furthermore, other Kondo parity signatures’’ are 
not observed here. Charge traps that exist in parallel with tunnelling 
barriers can release additional electrons to the transport”, resulting in 
occasional resonance-doubling features (Methods). The main features 
reported here are consistently reproduced in more than 50 devices (see 
Extended Data Fig. 6 for more examples), and do not fit the statistical 
profile of charge traps. In ultrasmall superconducting grains where 
SE> A, quantum fluctuations may promote the even-odd parity 
energy, leading to a possibly similar ZBP splitting’'. Such an effect, 
however, is only expected for T< Ty. 

Electron pairing without superconductivity can be described by a 
phenomenological Fermi-Hubbard model (equation (1)) with an 
attractive on-site potential””. The QD is represented by a one-dimen- 
sional chain of local pairing sites that can be occupied with zero, one, or 
two electrons. The Hamiltonian is written as 


H=-t S- Gare ae Ghee we) +U > Nit Nil 
10 i 


+ gHpB >) S7— MD Nic (1) 
i i, 

where ¢;*, and ¢;,, are creation and annihilation operators for electrons 
on site i with spin o = J,|; t quantifies the effective hopping between 
adjacent pairing sites; nj,=c;',cj¢ is the number operator; 
S? =(1/2)(nj,t—nji,) is the spin operator; U<0 represents the on- 
site attractive interaction strength; B is the applied magnetic field; and 
Lt is the chemical potential. For two electrons, equation (1) can be 
solved exactly (Methods), yielding analytic expressions for the pairing 
energy 4,(B)=\V16t? + U?—4t—gugB and critical pairing field 
By = (—4t+ V16t? + U?) /gup. 

The zero-temperature stability diagram of the 16-site model (Fig. 4b) 
qualitatively captures many of the experimentally observed features: the 
existence of a critical pairing field B,, a Zeeman-like splitting for 
|B| > B,, a decrease of B, with increasing y, and re-entrant pairing at 
higher magnetic fields (Fig. 2f). The Hamiltonian (equation (1)) has no 
disorder, resulting in the even level spacing seen in Fig. 4b. Adding 
some disorder to the Hamiltonian, either in the energy levels of each 
pairing site or in the kinetic hopping between pairing sites, makes the 
level spacings less regular, more closely resembling the spacings seen in 
the experimental ZBPs (Fig. 4a). Additionally, the superconducting 
regime of the attractive Hubbard model is not explored here, but is 
covered extensively in the literature’. Being phenomenological, equa- 
tion (1) does not specify a physical mechanism for the attractive on-site 
interaction. One form of the attractive interaction can be ‘negative-U’ 
centres, for example, oxygen vacancies or vacancy complexes, similar to 
those which have been linked to enhanced superconducting transition 
temperatures in Tl-doped PbTe (refs 24, 25). Bipolaronic mechanisms 
have also been advocated in the literature, especially by Alexandrov” 
(Methods). Such specific mechanisms for strong pairing are not directly 
implied by the measurements reported here, although some predictions 
may be testable with suitable refinements of this experimental 
approach. 

Note that spin-orbit coupling is neglected in this analysis, even 
though such effects are known to be important in two-dimensional 
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Figure 4 | Comparison between experiment and attractive Hubbard model. 
a, Dependence of ZBPs on V., and B of device A on a larger V.g scale (same 
colour scheme as Fig. 2f with a scale of 0-50 1S). b, Simulation result from the 
attractive Hubbard model of 16 sites with t= 1 meV and U= —0.8 meV. 


transport experiments”””*. Spin-orbit coupling makes electron pairs 


less sensitive to magnetic fields and leads to the violation of the Pauli 
limit in SrTiO; (Pauli limiting field jgH.” = 1.84T.~0.5T)””. 
However, it is not clear how such coupling will increase the pairing 
energy above T.. At the LaAlO3/SrTiO; interface, spin-orbit coupling 
is known’’”* to be strongly dependent on the carrier density n,. Direct 
measurements of carrier density are not feasible in the geometry 
employed here, although the density is believed to increase monoton- 
ically with gate voltage. 

The existence of pre-formed electron pairs in this SrTiO3-based 
system, forming a superconducting condensate at lower temperatures 
and lower magnetic fields, follows the paradigm of BEC superconduct- 
ivity. In the BEC regime, pairing is local and precedes the formation of 
a superconducting state. The only well-established physical embodi- 
ments of fermionic BEC-like superfluidity have been in ultracold 
atomic gases, where the BEC-BCS (Bardeen—Cooper-Schrieffer) 
crossover can be tuned via a Feshbach resonance*’. Although it is 
not clear whether the strength of the electron pairing can be tuned 
(for example, via strain), a crossover to BCS-like superconductivity at 
higher electron density is expected. The ability to confine electrons at 
nanoscale dimensions, combined with an inherent affinity for strong 
pairing, suggests that our system constitutes an ideal ‘laboratory’ 
in which to explore strongly correlated electronic phases in a solid- 
state host. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Sample growth. LaAlO;/SrTiO; samples are grown by pulsed laser deposition 
(PLD). The SrTiO; substrate is TiO, terminated by etching in buffered HF for 60 s 
either once or twice to improve substrate quality. Then the SrTiO; substrate is 
annealed at 1,000 °C for 2-12h to achieve an atomically smooth surface. A thin 
(3.4 unit cell) LaAlO; film is subsequently grown on top of SrTiO; by PLD at a 
temperature of 550°C and 1 X 10~* mbar oxygen pressure, and gradually cooled 
down to room temperature. Electrical contact to the LaAlO3/SrTiO; interface is 
made by Ar” etching (25nm) followed by sputter deposition of Ti/Au (5nm/ 
20 nm). Additional details are given in ref. 15. 

Device fabrication. The conductive atomic force microscope fabrication tech- 
nique’’ provides great versatility in the creation of nanoscale devices*!*”. The key 
device element is a nanoscale tunnel barrier that is integral to many devices, 
including nanowire junctions'’, sketched field-effect transistors (‘SketchFETs’)”’, 
photodiodes*, THz emitters and detectors*, and sketched single-electron tran- 
sistors (‘SketchSETs’)’*. In this work, conductive atomic force microscope litho- 
graphy is used to create 58 QD devices with varying dimensions (for example, 
250nm < Lap <1 pm) and barrier heights AR (see definition below) and on 
multiple SrTiO; substrates. Every device exhibits features that are qualitatively 
similar to device A. In the following, a single barrier device, which has a similar 
design to device A but contains only one barrier instead of two, is shown in 
order to demonstrate the device fabrication technique (Extended Data Fig. 1a). 
During the barrier fabrication, the four-terminal resistance difference 
AR = Rap — Ro is monitored in real time and serves as a figure of merit for 
low-temperature barrier performance, where Rap and Ro are the resistances of 
the QD wire and the open wire. Rap and R, are obtained simultaneously by two 
four-terminal measurements using two hardware-simulated lock-in amplifiers, 
such that wire decay is eliminated from the measurement and AR is very precise. 
Prior to barrier cutting, Rap and Ro are nominally the same with 4R <5kQ 
(within 1% difference). A sharp AFM tip (nominal radius of curvature ~8 nm) 
moves across the wire at 200 nm’ ' speed with small negative voltages (—0.1 V 
to —0.5V) multiple times, which causes AR to increase discretely, as shown 
in Extended Data Fig. 1b. Low-temperature transport study (T= 75 mK) 
shows the wire conductance can be pinched off by V,, for a single barrier 
device with AR~120kQ. The conductance oscillations (30mV < Vig < 
50 mV) indicate quantum mechanical tunnelling through the barrier (Extended 
Data Fig. 1c). 

Barrier height. The barrier resistance AR is a good indicator of the tunnel barrier 
width and strongly influences the low-temperature transport properties. 
Extended Data Fig. 2 shows transport characteristics of three 500 nm QD devices 
(devices F, G and H). The device designs are similar to that of device A, but with a 
shorter distance of 500 nm between the two barriers. The only difference among 
devices F, G and H is the single barrier resistance 4R/2, with 4R/2 = 20, 110 and 
305 kQ respectively. The resulting difference in transport is clear. Device F shows 
the most conductance diamonds for the smallest barrier resistance (supercon- 
ductivity is suppressed in this device after applying a Vpg of —5.6 V), while device 
H, with the largest barrier resistance, is virtually featureless. Conductance dia- 
monds and superconductivity-related phenomena in dI/dV of device G have V,.- 
dependence that is very similar to that of device A. Since the single barrier 
tunnelling rate decays exponentially with the barrier width, the differences in 
transport of devices F, G and H can be understood as a result of suppressed 
quantum tunnelling rates under the assumption that the two barriers are slightly 
asymmetric. 

Additional data for device A. The barriers can fully suppress the conductance 
of the wire in the low-V,, regime, while allowing resonant tunnelling current 
at selected low V,, values, and fully superconducting transport at high V., 
values. At low Veg values, the line cut in Extended Data Fig. 3a shows ZBPs 
separated by regions of fully suppressed conductance. At high V., values, 
conductance oscillations due to quantum tunnelling are reduced as the bar- 
riers become increasingly transparent and the device enters a fully supercon- 
ducting state. 

Below B,, the ZBPs are generally insensitive to magnetic fields. Above Bp the 
centres of the ZBP splittings move nonlinearly with magnetic fields, as shown in 
the red trace 5V,,(B) in Extended Data Fig. 4a. This movement, possibly arising 
from the orbital effect, is only observed in a fraction of devices. A similar effect has 
been reported in ref. 35. This is corrected (Extended Data Fig. 4b) by offsetting the 
magnetically induced global shift to keep the centres of ZBP splittings relatively 
constant, that is, V.g(B) = Vsgo(B) — 5Vsp(B). This shift procedure does not influ- 
ence the analysis of B,, nor does it change the relative spacing between splittings. A 
waterfall plot (Extended Data Fig. 5) of Fig. 2f is included to provide clear line cuts 
of the ZBPs at all fields. 


Analysis of B,. Above B,, single electron tunnelling occurs and standard 
Coulomb blockade physics analysis applies. Closely spaced Coulomb blockade 
ZBPs can be fitted to a multi-peak hyperbolic cosine expression*® 


dI/dV =Go+ Ai cosh] Bi (Vig — Vie) (2) 


where Go, A; and B; are constants. Equation (2) is used to examine two peaks 
that have split from a single ZBP at low magnetic fields in order to extract the ith 
peak location as a function of field (Vi,(B) indicated in the red trace in the 


main panel of Fig. 2f in the main text). This fit was performed for multiple pairs 
of split peaks in many devices. The energy difference of the split peaks, 
Ez(B)= a V5,(B) - Vi,(B)), where « is the coupling factor as described in the next 
section, can be fitted to a straight line as in Fig. 2g. The slope of this fit gives the 
Lande g-factor according to Ez = gitp(B — B,), where the Zeeman energy differ- 
ence is offset by the pairing energy E, = gjtpBy. As mentioned in the last section, 
‘straightening’ the magnetically induced global shift to keep the centres of ZBP 
splittings relatively constant does not disturb this analysis. Both peak positions 
Vi(B) and V,,(B) have the same offset at each field, so E7(B), from which gand B,, 
are calculated, remains unaffected. 

Device transport parameters. As mentioned in the main text, bulk SrTiO3 is an 
incipient ferroelectric at low temperatures with a divergently large and gate- 
tunable dielectric constant*” ¢,~ 20,000. Consequently, estimation of the 
gate-dot capacitance yields C,,~100fF, using a parallel wire model 
Cog = Té,é9L/cosh” \(d/2r) with vacuum permittivity ¢, QD length L=1 um, 
QD-gate spacing d= 1 jm, and nanowire radius ran = 5 nm = side gate radius 
Tg = 5nm. The corresponding charging energy is vanishingly small in the ideal 
case (that is, zero Veg) E. = &lCy <e7/ Cog = 2 eV, where Cz is the total capacit- 
ance of the QD. The actual E. could be larger since ¢, is expected to be reduced by 
electric field and strain effects at the interface. 

For a nanowire with length L= 11m and width w=5nm, the number of 
carriers can be estimated as N = n,Lw = 500 by using a typical two-dimensional 
LaAlO3/SrTiO3 carrier density n,~10'*cm~*. The mean level spacing 
between spin-degenerate levels can be estimated by using a ‘particle in a 
one-dimensional box’ model and effective mass** m* =0.7m,. This gives 


0E= & = mNir = 500 eV, which is consistent with the values of E,ag. As 
one can see, the addition energy is dominated by orbital level spacing SE. 

The ability of the side gate to tune the chemical potential of the device is 

characterized by « = C.,/Cy. The coupling factor can be calculated using the slopes 
f and y which define the diamonds in the dI/dV map, 1/x = 1/f + 1/y. Coupling 
factors vary from « ~ 0.03 to 0.13, with a typical « + 0.10. For all devices, the 
coupling factor is observed to decrease at high V,, values; this variation is reflected 
in Extended Data Table 1. 
Constant interaction model. The constant interaction model is widely used to 
analyse QD transport characteristics through two independent variables: 
Coulomb interactions and single-particle energy levels'®. Here, superconduct- 
ivity is combined with the constant interaction model and the analysis from ref. 
39 is extended by including non-zero orbital level spacing. In a QD with N 
electrons, the excess charge has two parts: the integer part n = N— No anda 
continuous part C.,V../e representing electrostatic charge induced by the gate, 
where Np is the charge at zero gate voltage. The system ground state energy E(N) 
can be written as 


E(N) = DoF + Ee(n—Vagze/ Ee)’ /2+pb, (3) 


where £; are single-particle energy levels, p is a parity factor with p = 0 (1) for even 
(odd) N and 6, is parity energy. The first term in equation (3) is the electrochem- 
ical contribution determined by quantum confinement, the second term is elec- 
trostatic part induced by V., and the third term is the extra energy 
Op = E(Noaa) — (E(Noda + 1) + E(Noaa — 1))/2 which the odd electron has to 
pay to enter the QD. The addition energy E,aa, which is the difference (of chemical 
potential 1) of a difference (of total energy E), is directly measured in the tunnel- 
ling spectroscopy measurement. Namely, the chemical potential is 


H(N) =E(N) —E(N—1) =Ey + E.(n—1/2) —eatVog + Bop (4) 
where f} = —1 (1) for even (odd) number of electrons (N). Exgq can subsequently 
be written as 

Exaa = UN +1) —H(N) = E- + 0E(N) +75p (5) 


where y = 2 (—2) for even (odd) number of electrons. Interestingly, as described in 
the main text, E,gq = E. — 26, can be negative in the odd case (6E(Noaa) = 0) if 
E. < 26), suggesting this unpaired electron is not stable and wants to pair with 
a partner. In a BCS superconductor, the parity energy is approximately the 
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gap energy (6, ~ A) in the limit of small level spacing 5E(N) compared to the 
superconducting gap 4 (SE < A). In the opposite extreme limit 6 > A, 6, can be 
enhanced such that 6, = 6E/2In(E/4) due to quantum fluctuations”’. Either way 
it is reasonable to assume E. < 26, based on the estimate of E., suggesting pair 
tunnelling is the preferred transport mechanism in the devices explored here. Note 
in the case of pairing without superconductivity, the parity energy 6, should be 
replaced by the pair binding energy Aj, in equations (3)-(5). When the temper- 
ature and magnetic field are increased, 6, and A, are suppressed to zero at the same 
B, where the Zeeman splitting of the peaks occurs. 

Attractive Hubbard model. In order to obtain more insights into the pairing 
picture, the analysis is further extended by using the attractive Hubbard model, 
which qualitatively captures many features seen in the devices. The Hamiltonian is 
written as in equation (1) in the main text. The only assumption is the attractive 
pairing potential U<0. The kinetic hopping t>0 describes the motion of the 
electron. For sufficiently attractive U <0, electrons will bind into pairs. This is the 
regime of the parity effect: in zero external field, the ground state as a function of 
gate voltage (represented by the chemical potential in equation (1)) will always 
contain an even number of electrons. The external magnetic field B favours polar- 
ized states, breaking the pairs. For magnetic fields greater than a critical field 
B> B,, ground states with odd electron numbers can be stabilized. The interaction 
between pairs causes the critical field to decrease monotonically with increasing 
filling (increasing chemical potential or gate voltage). 

As a simple example, the Hubbard model is solved on an infinite chain with 
zero, one and two electrons. The Bethe ansatz may be used to solve the model for 
arbitrary filling, but it is much more complicated than the approach presented 
here“. For zero and one electrons, the energies are simply given by: 


Ey =0 


1 
E, = —2t- Sha Bu 


Only the low-field two-electron ground state, which is a spin singlet*"”, is con- 
sidered for this analysis. The triplet will be the ground state at higher fields for two 
electrons. The ground state has zero momentum, so the wavefunction depends 
only on the separation between the electrons and must behave exponentially. Thus 
the non-normalized wavefunction is 


b(if)=e 
for electrons on sites i and j. For i # j, Schrodinger’s equation gives 
E,=—2t(eS +e) —2p (6) 
while for i = j it gives 
Ey =U—4te*—2 (7) 


Combining equations (6) and (7) yields 


iy (eae 
C=log 


At 


Ey = —V16t2 +U2—2y 
Thus the binding energy for an electron pair is 
Ap =2E, — Ey = Vv 160? + U? —4t—gygB 


and the ‘size’ of the pair is simply 1/¢. The boundary between the phases with 0 and 
1 electrons is given by 


4t+2u 
Sg 
The boundary between the phases with 1 and 2 electrons is given by 
—4t-+2/16P + U2 +2 
SH 
and the boundary between the phases with 0 and 2 electrons is independent of B: 


1 
es v 16t? + U2 


The plot of these three boundaries will have the shape of the letter ‘Y’. The three 
boundaries meet at a critical point given by 


ih 
Mp=—5V 16t? + U? 


0,1 = 


1,2 


—4t+V/16t? + U2 
Shp 


By= 
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To expand the discussion, the lowest eigenvalues of the Hubbard Hamiltonian 
(equation (1)) are solved on a 16-site chain using the iterative Lanczos algorithm, 
which is particularly effcient for sparse matrices‘*~*. The full Hilbert space has 4'° 
states, which are split into smaller subspaces using the total electron number, 
N.= >= nie, the total z-component of spin, S? = 5° $7, and the mirror symmetry 


of the system. The total spin is an additional symmetry of the Hamiltonian which is 
not exploited. The largest subspace contains 82,820,900 states. 

Extended Data Fig. 7a shows the energy of the ground state for N. = 16 in zero 
applied magnetic field. The lowest-energy state has total S* = 0 for N. even and 
S’ = 1/2 for N. odd. The ground state always has an even number of electrons, 
which can be seen by shifting all the energies by a suitable function of ju, chosen 
here as quadratic. This does not change their relative order but can make the 
energy differences easier to visualize. In Extended Data Fig. 7b, the parity effect 
is apparent: the ground state always contains an even number of electrons. 

Increasing the magnetic field reduces the energy of the higher spin states relative 
to the S=0 ground states at B= 0. The ground state is polarized in the —z 
direction, so the total spin S is identical to the z-component of spin S*. The phase 
diagram as a function of magnetic field and chemical potential y is shown in 
Extended Data Fig. 8. At low magnetic fields, the system consists of electron pairs: 
N- is even, and the total spin S = 0. At slightly higher fields, it becomes favourable 
to have a single unpaired electron, resulting in odd N, and S = 1/2. Increasing the 
field further results in two unpaired electrons. Now N, is even again, but with total 
spin S = 1. The pattern continues with increasing field—the number of unpaired 
electrons increases monotonically. 

Pairing mechanisms. The attractive Hubbard model does not specify a physical 
origin of the pairing mechanism. Here we discuss two possible forms of pairing 
sites: negative-U centres and bipolarons. The negative-U centre, which hosts a 
bounded electron pair, was first proposed in ref. 47 to account for the diamagnet- 
ism in amorphous semiconductors. Its existence has been reported in various 
materials, for example, hydrogenic’® or oxygen impurities in GaAs. 
Meanwhile, negative-U centres have been proposed as a pairing mechanism in 
some unconventional superconductors such as Tl-doped PbTe (refs 24, 25). In 
SrTiO;, negative-U centres can possibly originate from oxygen vacancies (or 
vacancy clusters) since the lowest threshold carrier density is only observed in 
vacancy doped samples but not in samples with other n-type dopants (for example, 
Nb)**. Another possible mechanism for local pairing is bipolaron formation. 
Bipolarons are bound states of two polarons”, which are self-localized electronic 
states formed from lattice distortions—for example, via the Jahn-Teller effect*’. 
When two polarons meet, they can share the same lattice distortion, lowering the 
total energy per electron and thus forming a bound state under certain conditions. 
The existence of polarons in SrTiO; has been extensively reported (see, for 
example, refs 52, 53). While there is no definitive experimental evidence of bipo- 
laron formation in SrTiO3, there are reports of bipolarons in other titanites™*. 
Single-electron charge traps. A single-electron charge trap can be modelled with 
a series of capacitances that reflect the coupling between the trap and the QD, 
source and drain, as described in ref. 20. When the trap is in series with the QD, a 
large source-drain bias is needed to allow electrons to pass through the typically 
misaligned energy levels of the trap and QD. Namely, the conductance diamonds 
will have a large gap close to the zero-bias region (in contrast to our observations). 
When the trap is in parallel with the QD, the contribution to the conductance will 
be negligible since the coupling between the trap and either source or drain will be 
very weak due to the small trap size (compared to the 1 jm nanowire QD length). 
A more realistic scenario is a combination of both the series and parallel coupling. 
Namely, the trap is in parallel with one of the tunnel barriers and can occasionally 
release an electron to QD, which is commonly referred as the background or offset 
charge****. The transport signatures of this type of trap are ‘sawtooth’-like dia- 
monds, and abrupt shifts of ZBPs in external magnetic fields. Such features are not 
present in results reported here. Finally, perturbations of charge traps to the QD 
only happen occasionally, while the splittings are observed in every ZBP of every 
device in the work reported here. 

Sample size. No statistical methods were used to predetermine sample size. 
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Extended Data Figure 1 | Nanoscale potential barrier engineering and low- _c, The differential conductance di/dV as a function of V.g at T= 75 mK. The 
wire can be pinched off by Vg at the barrier site, as the wire conductance 


temperature transport characteristics. a, Single-barrier device schematic. It 
has the same structure as device A except that only one barrier is integrated in _ becomes negligibly small in lower V,. values. 


the design. b, Resistance change during barrier cutting (Methods); t is time. 
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Extended Data Figure 2 | Transport properties of three [gp = 500 nm SET Vj, applied on the substrate of —5.6 V to pinch off the device since Vig 
devices of different single barrier heights at T= 50 mK. Plots show colour- has limited tunability due to leakage at high absolute values. b, Device G 
coded di/dV as a function of V., and voltage across the QDs in devices F (V34z),  (4R/2 = 110 kQ) shows similar properties to device A. c, Device H 

G (V34g) and H (V341). a, Device F (AR/2 = 20 kQ) requires a back gate voltage = (4R/2 = 305kQ) shows no conductance diamonds. 
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Extended Data Figure 3 | Transport characteristics due to barrier of transport in device A at V.g = —25 mV (c), resonant tunnelling transport at 
Vsg = —19mV (d), and fully superconducting transport at V., = 0 mV (e). 


confinement of device A. a, Zero-bias line cut of the dI/dV map in b; filled 
black symbols show positions of line cuts displayed in c-e. c-e, Full suppression 
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Extended Data Figure 4 | Global shift correction of data from device A. global shift is illustrated by the red trace, 5V.,(B). b, The same data shown in 
a, Original data for Figs 2f and 4a in the main text; B, applied magnetic field. The Figs 2fand 4a in the main text that are corrected by V,,(B) = Vego(B) — 5Vsg(B). 
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Extended Data Figure 6 | Transport characteristics of devices B,C,DandE, _ c, Device C di/dV as a function of Veg and V3q at Vpg = —4.4 V and 

which are all of the same geometry as device A. Device letter is shown atlower T= 100mK. d, Device C ZBP splitting in an out-of-plane magnetic field. 
right-hand corner of all plots. a, Device B dI/dV (colour coded) asa function of __e, Device D di/dV as a function of V,, and V34 at Vpg = — 1.4 V and 

Veg and V34 at Vig = —0.7 V and T= 100 mK. A small gap (44) close to zero- T= 100mK. f, Device D ZBP splitting in an out-of-plane magnetic field. 

bias in the diamonds is due to the absence of normal carriers in the g, Device E di/dV dependent on V., and V34 at Vig = —2.2 V and T= 100 mK. 
superconducting source/drain leads. b, Device B ZBP splitting in an out-of- h, Device E ZBP splitting in an out-of-plane magnetic field. 

plane magnetic field B, which is the same data as Fig. 3a in the main text. 
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16 sites B=O U=-0.8 meV t=1.0 meV 16 sites B=O0 U=-0.8 meV t=1.0 meV 
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u (meV) H (meV) 
Extended Data Figure 7 | Parity effect. a, Energies E; of the Hubbard model _ b, Energies of the Hubbard model for the same parameters as a shifted by a 
(equation (1) in main text) of a one-dimensional 16-site chain with open quadratic function of y, E;*(t) = E(w) + cu’, where c is arbitrary. The lowest 
boundary conditions, t= 1 meV, U= —0.8 meV, and B = 0 for fillings energy for each value of 1 is easier to discern. The ground state always has even 


N. = 16. The slope of each line is proportional to —N,; red (blue) lines have N.- 
even (odd) N-. For all chemical potentials 1, the ground state has even N.. 
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Extended Data Figure 8 | Phase diagram of the Hubbard model onaone- _as N.(S). The quantum numbers of the other phases can be deduced from their 


dimensional 16-site chain with t= 1 meV and U= —0.8 meV. The total neighbours. 
number of electrons N, and total spin S are labelled for some of the larger phases 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 1 | Parameters of eight devices 


Device Lw(HM) Lon(HM) AR/2(kQ) Coupling Factor 


Name Range (eV/V) 
A DES 1 32 0.08-0.13 
B 2.5 1 30 0.06-0.11 
C DS 1 15 0.04-0.10 
D 2.5 1 15 0.04-0.10 
E DS) 1 25 0.03-0.06 
F 2 0.5 20 0.03-0.06 
G D 0.5 110 0.06-0.12 
H 2 0.5 305 0.07-0.09 


All devices have a schematic similar to that of device A but with different control (open) wire length L,,, distance between two barriers Lop, single barrier resistance 4R/2, and range of side gate coupling factor « forall 
the diamonds. 
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Quantum coherent optical phase modulation in an 
ultrafast transmission electron microscope 


Armin Feist!, Katharina E. Echternkamp’, Jakob Schauss', Sergey V. Yalunin', Sascha Schiffer! & Claus Ropers! 


Coherent manipulation of quantum systems with light is expected 
to be a cornerstone of future information and communication 
technology, including quantum computation and cryptography’. 
The transfer of an optical phase onto a quantum wavefunction is a 
defining aspect of coherent interactions and forms the basis of 
quantum state preparation, synchronization and metrology. 
Light-phase-modulated electron states near atoms and molecules 
are essential for the techniques of attosecond science, including the 
generation of extreme-ultraviolet pulses and orbital tomo- 
graphy”*. In contrast, the quantum-coherent phase-modulation 
of energetic free-electron beams has not been demonstrated, 
although it promises direct access to ultrafast imaging and spec- 
troscopy with tailored electron pulses on the attosecond scale. Here 
we demonstrate the coherent quantum state manipulation of free- 
electron populations in an electron microscope beam. We employ 
the interaction of ultrashort electron pulses with optical near- 
fields*° to induce Rabi oscillations in the populations of electron 
momentum states, observed as a function of the optical driving 
field. Excellent agreement with the scaling of an equal-Rabi multi- 
level quantum ladder is obtained”, representing the observation of 
a light-driven ‘quantum walk” coherently reshaping electron den- 
sity in momentum space". We note that, after the interaction, the 
optically generated superposition of momentum states evolves into 
a train of attosecond electron pulses. Our results reveal the poten- 
tial of quantum control for the precision structuring of electron 
densities, with possible applications ranging from ultrafast elec- 
tron spectroscopy and microscopy to accelerator science and free- 
electron lasers. 

The interaction of propagating light with confined electrons in 
atoms, molecules and solids is omnipresent, but the opposite case— 
the coupling of free electrons to localized optical fields—is not a nat- 
urally occurring phenomenon. Nonetheless, in both cases, the prin- 
ciple of confinement allows for optical transitions in otherwise 
mismatched electron and photon dispersion relations’*. Controlling 
free-electron propagation with low-frequency electromagnetic fields 
in resonator geometries is an integral aspect of accelerator science’’. At 
optical frequencies, however, particular challenges arise from the 
requirements of very controlled electron beams and tailored nano- 
structure near-fields. Increasing efforts are currently devoted to optic- 
ally drive electron trajectories on the nanoscale—for example, for 
applications in attosecond science and lightwave electronics'*"”. 

Some of the elementary phenomena involved in coupling free elec- 
trons to light were described more than half a century ago: in the 
Kapitza—Dirac effect”®”', electrons are elastically scattered off a stand- 
ing light wave, whereas the Smith-Purcell effect and its variants*???”* 
treat the inelastic interaction of free electrons with confined modes 
close to a grating. Recently, ultrafast electron microscopy schemes 
showed that the kinetic energy distribution of short electron pulses 
develops a series of photon sidebands after passage through an intense 
optical near-field*®. This approach, termed photon-induced near-field 
electron microscopy (PINEM)*, has been employed in the temporal 
characterization of ultrashort electron pulses (see Methods) and as a 


contrast mechanism in electron microscopy”. Beyond such 


advanced applications, the underlying interaction should allow for 
the preparation of coherent electronic superposition states and a 
phase-controlled harnessing of quantum coherence for the temporal 
shaping of electron bunches. 

Here we report the coherent phase-modulation of free-electron 
states in a nano-optical field. We experimentally induce multilevel 
Rabi oscillations in the form of a quantum walk in momentum space, 
obtaining excellent agreement with theoretical predictions by Garcia 
de Abajo et al.° and Park et al.° of this interaction. Moreover, we 
demonstrate theoretically that dispersive propagation transforms the 
optically modulated electron wavepacket into a train of attosecond 
peaks. In the experimental scenario displayed in Fig. 1, femtosecond 
electron pulses are generated by nonlinear photoemission from a 
nanoscale cathode***. After collimation and acceleration to an energy 
of 120 keV, the magnetic lens system of a transmission electron micro- 
scope focuses the electron pulses to a spot diameter of 15 nm in close 
vicinity to an optically excited conical gold tip. The localization of the 
nanostructure’s near-field mediates the optical interaction with the 
free electrons. This leads to the creation of multiple spectral sidebands, 
each corresponding to the absorption/emission of an integer number 
of photons (spectrum in Fig. le)**. Detailed information about the 
interaction process is encoded in the number of populated sidebands 
and their individual amplitudes. For example, the maximum electron 
energy gain in the optical near field is a quantitative measure of the 
local transition amplitude, which can be imaged by raster scanning the 
electron focus (Fig. 1b). 

Microscopically, the electron-light interaction studied here consti- 
tutes an optical phase-modulation of the electron wavefunction’. 
Expressed as a quantum mechanical multilevel system, electron energy 
levels spaced by the photon energy iw are coupled in the optical near- 
field (level diagram, Fig. 1d). Previous experiments studying this inter- 
action found a partial reduction of the initial electronic state popu- 
lation and a spectral broadening with distributions gradually decaying 
towards large photon orders**”****. Such observations evidence tran- 
sitions dominated by sequential multilevel excitation (processes of 
type I in Fig. 1d). However, it is assumed that the coupling process 
is coherent in nature*®, which implies that quantum features arising 
from multipath interference (type II) should also be observable. 

In order to identify such phenomena, we require an interaction of 
uniform strength with the entire electron ensemble in the pulse®. This 
scenario is achieved by using a spatially narrow probing beam and, in 
contrast to earlier works, an optical near-field excitation which has a 
uniform amplitude during the transit of the electron pulse envelope 
(see Methods). 

Under these conditions, we find experimentally that the population 
of photon sidebands exhibits a pronounced oscillatory behaviour cor- 
responding to multilevel Rabi oscillations, as demonstrated in Fig. 2 for 
electron spectra at a fixed position near the tip shaft. A colour-coded 
map (Fig. 2a) displays the evolution of the interaction-induced kinetic 
energy distribution with growing incident field strength. With increas- 
ing driving field, we observe a linear spreading in the range of 
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Figure 1 | Schematic and principles of coherent inelastic electron scattering 
by optical near-fields. a, Experimental scheme. Ultrashort electron pulses 
generated by nanotip photoemission are accelerated and focused to a beam 
that interacts with the optical near-field of a nanostructure, phase-modulating 
the electron pulse and exchanging energy in integer multiples of the photon 
energy. b, Raster-scanned image of the energy cutoff in the inelastic electron 
scattering spectra, representing the local transition amplitude (see text). 

c, Incident kinetic energy spectrum (full-width at half-maximum, 0.7 eV) 
centred at Ey = 120 keV. d, Energy level diagram of ladder states with spacing 
hq coupled to the initial state at Eo. Arrows indicate sequential multistate 
population transfer (type I) and interfering quantum paths (type II) leading to 
multilevel Rabi oscillations. e, Example of kinetic energy spectrum after the 
near-field interaction, exhibiting a spectral comb with multiple sidebands 
separated by the photon energy and modulated in occupation. 


populated sidebands, together with strong oscillations in the central 
part of the spectra. Specifically, the experimental spectra exhibit a 
nearly complete extinction of the initial state occupation and its pro- 
nounced recurrence at incident fields of 0.023 V nm’ (red line) and 
0.040 V nm! (green line), respectively (Fig. 2c). Quantitative analysis 
of the field-dependent spectral evolution (Fig. 2b) shows the oscilla- 
tions of the initial state population (‘zero loss peak’) and those of 
different electronic sidebands. These modulations directly evidence 
multilevel Rabi oscillations and thereby a quantum coherent manip- 
ulation of the respective level amplitudes, which, as a function of field 
strength, traces out the evolution of an elementary quantum walk”. 
Recently, near-field-induced free-electron transitions, as observed 
here, were theoretically treated by solving the time-dependent 
Schrédinger equation®® (compare our results to figure 2 in ref. 5, 
which depicts population oscillations simulated as a function of intens- 
ity). Yielding equivalent theoretical results, we present a compact 
description using raising and lowering operators acting on the 
electronic state |Eo) of the system at an initial energy Ey. As demon- 
strated in the Methods section, the action of the near-field can be 
described by a scattering matrix $ = exp[g*a — ga"] with a dimension- 
less near-field coupling constant g proportional to the field strength 
and the transition matrix element. One may notice that S takes the 


LETTER 


a Incident field (V nm) Rabi phase 
0 0.05 0.10 T 2n 3n 
L y I T : I : T y T 
a os 
—_—— 
5- _—— a = 
to) a ee 
re] L —— an | 
roy —_— il ”- a See | 
—_— —_— —_- -_ a 
5 0 js a — an —— =| 
a= —_ —_— — _ —_—_ 
fe} Se oe — in — 
ce r ——<—- | ee a "| 
—_—_ —_ — 
Be a ee 
iol 
0- a” 3 
b Incident field (V nm) 
0 0.02 0.04 0.06 
1.0 T T , T 1 T TTT 7 TT 
Ss 
0.8 - FS 
a3 0.1 
ey ire} 
382 
0.6: - oe 
l 0.01 
0.4 Field (V nm-') 
ra 
To} 
o 
a 
2 
a 
c 
2 
fo 
= | 
[s} 
je) 
[e) 
c 
> L 4 
e 
o 3 
g 
gv L 4 
° 
a 2 
a 
o 
2 r 7 
@ 
oO 1 
a 


-10 5 0 5 10 15 
Energy loss (eV) 


Figure 2 | Quantum coherent manipulation of electron energy 
distributions. a, Experimental electron energy distributions as a function of 
the incident optical field strength (left) and theoretical prediction in terms of 
Nth-order Bessel functions (right). b, Occupation probabilities (open circles) of 
the Nth-order spectral sidebands shown in a, adding contributions from +|N|. 
Solid lines, Nth-order Bessel functions. Inset, double-logarithmic plot of the 
sideband populations near onset. Solid lines with slope 2N are shown for 
comparison. ¢, Electron energy spectra at incident optical fields of 0, 0.023, 
0.040, 0.053 and 0.068 V nm ' (bottom to top). Spectra shown in a and c are 
normalized to their respective maxima. 
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form of a displacement operator generating coherent states in the 
harmonic oscillator model. In a variation of this scenario, we use a‘ 
and a as commuting raising and lowering operators, connecting free- 
electron states separated by the photon energy, that is, a‘|E)) = 
|Eo + ha) and alEo) = |Ey — iw). The scattering matrix represents 
the Hamiltonian evolution of the system as a unitary operation on 
the initial electron wavefunction |Eo), which leaves the electron in a 
superposition of ladder states |Ey + Nha), with N a natural number. 

In the limit of small optical driving fields, the scattering matrix is 
dominated by sequential multiphoton terms, for example, be (ga‘) . 
corresponding to type I transitions (Fig. 1d). The occurrence of inter- 
fering quantum paths at increased optical field strength (type II paths) 
becomes apparent by considering higher order terms in the Taylor 
expansion of S, such as a’ and a‘a‘a, which both facilitate the trans- 
ition between the states |Eo) and |Ey + ha), but each with a different 
phase factor in the final state. 

Interestingly, because of the practically constant coupling matrix 
elements between adjacent levels (the ‘equal Rabi’ case’), the occu- 
pation probability of the Nth photon sideband can be described by a 
very simple analytical expression in the form of the Nth-order Bessel 
function of the first kind®*, that is, |(Ey + Nha|S|Eo)|* = |Jn(2|g\)|?. In 
a spatial representation, these transitions arise from a sinusoidal phase 
modulation of the wavefunction traversing the optical near-field. 
Accordingly, such sideband populations are also commonly encoun- 
tered in other physical systems using phase modulation, for example, 
in acousto-optics”. 

Comparing the experimental field-dependent electron populations 
with the analytical result (Fig. 2b), an excellent agreement is found 
both in the location and amplitudes of the respective occupation 
minima/maxima. The entire data set is described with a single Rabi 
phase 2|g| =Finc X 98V_' nm linearly increasing with the incident 
optical field strength F,,., yielding a quantitative measure of the trans- 
ition matrix element. As detailed in the Methods section, incomplete 
modulation of the Rabi oscillations at higher fields is caused by the 
finite spatial and temporal electron pulse widths within the optical 
near-field. Besides the predicted population oscillations, the character- 
istic low-field multiphoton limit of the electron-light interaction is 
also experimentally regained (slopes of 2N in the field in a double- 
logarithmic plot, see inset of Fig. 2b). Larger incident fields promi- 
nently transfer the electron distribution to the outer spectral lobes, 
creating a well-defined cutoff around |E — Eo| = 2|g|/@, equal to the 
maximum classical energy transfer. Thus, as in other instances of 
electrons driven by intense optical near-fields*’*”’, the interaction 
energy is governed directly by the field amplitude instead of the pon- 
deromotive energy. 

This periodic phase (and correspondingly momentum) modulation 
of the electron wavefunction has important consequences for its sub- 
sequent evolution in free propagation. Generally, momentum modu- 
lation of classical states in particle accelerators is used for bunch 
compression’, and an optical variant of this principle was recently 
proposed using ponderomotive forces acting on classical point part- 
icles**. However, the present conditions with electronic coherence times 
exceeding the optical period necessitate a quantum mechanical descrip- 
tion of bunch reshaping. Figure 3a displays a few cycles of the simulated 
electron density in a periodically phase-modulated wavepacket as a 
function of the propagation distance and the arrival time at this distance 
relative to the centre of the pulse. Specifically, the phased superposition 
of momentum states reshapes into a high-contrast train of attosecond 
pulses at a well-defined distance downstream from the interaction 
region. For typical coupling constants achieved in the experiments, 
we obtain a temporal focusing into a train of pulses only about 80 as 
long, at a distance of 1.8mm behind the sample. Further dispersion 
spreads the distribution corresponding to its momentum content, with 
the possibility of revivals. Note that a spatial optical equivalent of this 
generation of attosecond spikes is given by Fresnel diffraction at a 
sinusoidal phase grating into a near-field fringe pattern’’, and also that 
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an early theoretical scheme for subfemtosecond optical pulse generation 
relied on frequency modulation and subsequent reshaping”. 

The physical origin of this electron pulse compression can be illu- 
strated using a phase space representation of the quantum state in the 
form of a Wigner function. This function is a quantum mechanical 
analogue of a phase space density, which, however, can also take nega- 
tive values for non-classical states**. Figure 3b displays the Wigner 
function of one period of a propagated state at the temporal focus 
and for a typical momentum distribution (projection in Fig. 3c). In 
this plane, free propagation of the initially sinusoidal momentum 
modulation has sheared the phase space distribution to a situation 
where a highly localized projection onto the position axis, that is, 
arrival time, is formed (Fig. 3d). In fact, the generation of this attose- 
cond electron pulse train is very robust with respect to variations of the 
specific temporal and energetic structure of the initial electron pulse 
(see Methods). The practically usable focal distances (a few milli- 
metres) render this scheme directly applicable in electron microscopy 
or spectroscopy studies with attosecond precision, a domain at present 
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Figure 3 | Formation of an attosecond electron pulse train. a, Development 
of a periodically modulated electron pulse structure (normalized electron 
density is colour coded) as a function of the propagation distance after the near- 
field interaction (numerical simulation for |g| = 5.7). Free propagation causes 
a temporal focusing into a train of attosecond spikes (red) with a period of 

T = 2.55 fs (optical period). b, Phase space (Wigner) representation of one 
period of the light-modulated electron quantum state at the temporal focus 
position (propagation distance of 1.8mm in a). Note that time and space 
variables for the swift electron pulse can be used equivalently via x = vt (v is 
mean electron velocity). c, Momentum projection of Wigner function 
exhibiting spectral modulations as observed in the experiments, displayed in 
units of transferred momentum quanta (average momentum subtracted). 

d, Central part of spatial projection, expressed in terms of electron arrival time 
in laboratory frame. A peak with a duration of only 82 as (FWHM) is produced. 


©2015 Macmillan Publishers Limited. All rights reserved 


only accessible by attosecond light pulses’. Specifically, the temporal 
electron comb will enable the phase-resolved investigation of coherent 
sample excitations, thus tracing structural or electronic changes car- 
rying optical phase information. 

In conclusion, we have demonstrated the quantum coherent manip- 
ulation of free-electron wavefunctions by their interaction with nano- 
confined light fields, observing near-perfect correspondence to the 
behaviour of a multilevel model Hamiltonian. Thinking beyond a 
single-variable state control, near-field interactions are expected to 
cause entanglement of longitudinal and transverse momentum com- 
ponents, and moreover, Coulomb interactions in a beam crossover 
will result in correlations between multiple electrons. Both features 
may be crucial for employing free electrons in quantum information 
technology’. Perhaps surprisingly, the generation of an attosecond 
electron train is a direct and natural consequence of this optical 
phase-modulation. We anticipate various applications of this concept 
in imaging and spectroscopy—for example, in the phase-resolved 
detection of coherent, resonantly driven polarizations in solid state 
materials—thus opening up the study of attosecond phenomena in 
electron microscopy. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Ultrafast TEM and experimental setup. We have recently constructed an ultra- 
fast TEM (UTEM) to enable a variety of laser-pump/electron-probe imaging 
schemes with high spatial resolution. The microscope is based on a commercial 
Schottky field emission TEM (JEOL JEM-2100F), which we modified to allow for 
both optical sample excitation and laser-driven electron pulse generation in the 
gun, as shown in Extended Data Fig. 1. In contrast to previous implementations of 
time-resolved TEM, our instrument features a nanotip photocathode as the source 
of ultrashort electron pulses. Compared to planar emitters, such needle cathodes 
provide reduced electron beam emittance, which is particularly useful for nanos- 
cale probing and spectroscopy. In the present experiments, we employ electron 
pulses with a repetition rate of 250 kHz, which are induced via two-photon photo- 
emission by irradiating the apex of a tungsten field emission cathode (apex radius 
of curvature of about 120 nm) with ultrashort laser pulses (400 nm central wave- 
length, 50fs pulse duration, 4.1nJ pulse energy, 25GWcm ” incident peak 
intensity). The emitter tip is operated at room temperature and with an electro- 
static extraction field of 0.1 Vnm'. 

Atan electron energy of 120 keV, the beam is focused to a spot diameter of about 
15nm, with a typical electron pulse duration of 700-900 fs (full-width at half- 
maximum, FWHM) at the position of the sample (characterized by electron-laser 
cross-correlation, see below). This pulse duration is at present governed by velocity 
dispersion of electrons with different initial kinetic energies after photoemission, 
which, however, is not a limitation for the measurements described here. While 
pulses may contain many electrons, all experiments reported here were acquired 
with less than one electron per pulse at the sample position (single-electron mode), 
thus avoiding potential space charge influences. 

The laser pulse exciting the optical near-field (central wavelength of 800 nm, 
pulse energy of up to 60 nJ) is stretched by dispersion in glass to a pulse duration 
of 3.4ps and focused to a spot size of ~50,m, yielding peak intensities of 
1.33GWcm ~ at the largest applied fluence. The excitation with a laser pulse of 
a duration much longer than that of the electron pulse allows for the observation 
of coherent population oscillations (see section ‘Numerical calculations’ below). 

After interaction with the optical near-field, the electron pulse is imaged (mag- 
nification of 25,000) into an electron energy-loss spectrometer (EELS) to analyse 
its kinetic energy distribution (spectrometer entrance aperture of 3 mm, energy 
dispersion of 0.05 eV per detector channel). 

Modulated electron spectra are observed at any electron probe position 

within the optical near-field. However, the quality of the spectral modulation 
and full extinction of individual orders crucially depends on the transverse 
homogeneity of the near-field on the scale of the electron beam diameter (see 
section ‘Numerical calculations’ below). For a nanoscopic tip, the optical near- 
fields are only slowly varying along its shaft. Therefore, we chose a position of 
the electron focus several micrometres away from the tip apex (see Extended 
Data Fig. 2). 
Temporal characterization of electron pulses. In UTEM, the duration of the 
electron pulses is governed mainly by Coulomb repulsion and dispersive broad- 
ening within the electron gun and subsequent electron optics. The resulting elec- 
tron pulse structure in the sample plane can be quantitatively characterized 
employing inelastic near-field electron scattering****’, as discussed in the main 
text. However, to this end, and in contrast to the experimental conditions 
described in the previous section, laser pulses of a duration (here 50 fs) much 
shorter than the electron pulse duration should be used in an electron/near-field 
cross-correlation”’. 

Extended Data Fig. 3a displays electron energy spectra as a function of the 
temporal delay between the optical excitation and the electron pulse arrival, sub- 
tracting the electron spectrum in the absence of near-field excitation. Therefore, 
the central blue feature corresponds to the reduction of the zero-loss peak due to 
scattering of electrons into multiple photon sidebands (red stripes). The width of 
these features (Extended Data Fig. 3b) and their tilt in the energy-time diagram 
(inset) provide measures of the electron pulse duration and chirp, respectively. 
Here, we obtain a cross-correlation with a FWHM of 800 fs (standard deviation of 
340 fs) and a chirp of ~760fseV ' for an initial energy spread of 1.3 eV. In the 
experiments presented in the main text, electron pulses with a narrower energy 
spread of 0.7eV were achieved. For the much shorter near-field excitation 
employed here, the scattering signal is linear in the momentary density of electrons 
within the pulse. Therefore, we can extract an energy-time representation of the 
electron pulse by superimposing all tilted sidebands (Extended Data Fig. 3c). 
Finally, we emphasize that temporally stretching the near-field excitation to more 
than 3 ps in the experiments ensures that it lasts much longer than the electron 
pulse duration and, therefore, provides for a nearly homogeneous scattering 
amplitude throughout the electron pulse. 

Data analysis and drift correction. In order to obtain high quality electron 
spectra at a fixed sample location for varying driving field strengths, it has to be 


ensured that the laser-induced sample displacement, for example, by thermal 
expansion, is compensated for. This was achieved by first characterizing the 
fluence-dependent sample shift in imaging mode (observed up to 150 nm) and 
by automated electron beam repositioning between experimental runs. In addi- 
tion, slow residual drifts (up to 20nm, see Extended Data Fig. 2a) were cor- 
rected for by continuous line scans perpendicular to the gold surface and using 
the strength of the bulk plasmon band to identify the beam-surface distance. In 
the recorded spectra, energy losses due to bulk plasmon excitation generate a 
weak and spectrally broad band at energies above 15 eV (ref. 36), which is only 
present when the electron beam is placed in close proximity to the tip so that 
the outer tail of the electron focal spot grazes the tip surface. The plasmon 
contribution is well-separated from the main spectral features and can be easily 
subtracted from the spectra. Specifically, we identify the sideband populations 
by adopting a global fit function containing pseudo-Voigt profiles V,(E) for the 
zero-loss peak and all photon sidebands (using symmetric amplitudes in +|N)). 
The plasmon peak at a loss energy E,, was described by an asymmetric 
Gaussian G,)(E): 


P(E) =Gy(E—Ey) + y diy Vp(E+Nho) (1) 


Extended Data Fig. 4 shows a typical electron spectrum together with the fitted 
function. Note that an energetic shift of the sideband comb relative to the zero- 
loss peak due to electron chirp is absent in the case of a long excitation pulse 
relative to the electron pulse duration and does not have to be included in the 
fit. An evaluation of the strength of the plasmon band as a function of beam- 
surface distance allows for a positioning accuracy of +5nm. 

Materials. The nanostructure employed in this work was prepared from a thin 
gold wire (diameter 250 jum) which was subjected to thermal annealing in vacuum 
(800 °C, 12h) to increase crystallinity and reduce surface roughness’’. A sharp tip 
(100 nm apex radius) was formed by electrochemical etching in aqueous hydro- 
chloric acid (37%)**. Afterwards, the conical part of the tip (length ~50 jim) was 
cut by focused ion beam milling, transferred to a silicon frame and attached by ion- 
beam deposited platinum. 

Quantum description using ladder operators. The interaction of electrons tra- 
versing an optical near-field has been theoretically treated several times in the 
past, usually by either direct integration of the time-dependent Schrodinger 
equation® or using a Green’s function formalism*. The relation of this stimu- 
lated process to the spontaneous mechanisms observed by electron energy-loss 
spectroscopy and cathodoluminescence is discussed in ref. 39. Furthermore, it 
was also shown that a non-relativistic approach is sufficient as long as the rela- 
tivistically correct electron dispersion (velocity as a function of energy) is used in 
the final result*’. Here, we present an alternative derivation of inelastic near-field 
scattering probabilities using ladder operators, which allows for a succinct 
description. 

The raising and lowering operators. Electrons in a time-harmonic electromagnetic 
field can experience energy loss or gain in multiples of the photon energy ha, 
where is the frequency of the field. This allows us to treat the problem as a 
multilevel quantum system. Within the Schrédinger picture, the free-electron 
Hamiltonian Ho does not depend on time, while the wavefunction |y(t)) of the 
electron is time-dependent. Thus, the total Hamiltonian for the interaction with 
the electromagnetic field in the velocity gauge is 


H=Hy+— pA (2) 
ba on Pp 


where A is the space- and time-dependent vector potential, and p, e and m are 
the electron momentum, charge and mass, respectively. For a time-harmonic 
vector potential, a natural basis set is composed of plane wave states |N) offset 
from the initial energy Ey by an integer multiple N of the photon energy, where 
each state |N) is an eigenstate of the unperturbed Hamiltonian Hp: 
Ho|N) = (Ey + Nha) |N). Thus, |0) is the initial state, and |N) corresponds to 
the state with |N| absorbed/emitted quanta. The time-harmonic interaction 
Hamiltonian causes transitions between these basis states. In particular, the 
matrix elements between adjacent states of the form (N+1|£pA|N) will lead 
to considerable transition probabilities. In contrast, the coupling between states 
separated by more than one photon energy causes transition amplitudes rapidly 
oscillating in time (at multiple frequencies of w), which prevents direct multi- 
photon transitions. (Note that multiphoton transitions will become possible by 
multiple actions of the field.) 

In order to compute the coupling between neighbouring states, let us consider 
for simplicity a one-dimensional model with the time-harmonic vector potential 
A = F(z)sin(ct)/@, where F(z) is the spatial distribution of the electric field ampli- 
tude. To obtain the matrix elements (N + 1|£pA|N), we use |N) in a plane wave 


m 


form L~? exp(ikyz) ina finite spatial interval of length L, where /iky is the electron 
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momentum. In this representation, the matrix elements can be readily computed, 
for instance 


1/2 
e 2hvng , e F 
(w+1|< pal) = [ sin(at), oa | F(z) exp(—idkz)dz (3) 
-1/2 


where vy is the electron velocity in the state |N), and Ak ~ «/vy is the electron 
momentum change (divided by /). The dimensionless coupling constant g 
expressed in terms of a Fourier amplitude in equation (3) was introduced in 
a similar form as used in ref. 6. Physically, g describes the momentum com- 
ponent in the near-field distribution which allows for total energy and 
momentum conservation in the transition, that is, it represents the momentum 
change of an electron undergoing an energy transition of fw. Regarding the 
integration limits in equation (3), at present, it is only important that the 
interval length L is larger than the extension of the near-field, as L will cancel 
out in the final result. It should be noted that for an initial energy much higher 
than the maximum number of absorbed or emitted photons, Ey>>|N|h@, the 
coupling matrix elements in equation (3) become practically independent of N, 
as does the velocity v~ vy. The presence of a single and universal coupling 
constant renders the present quantum system a nearly perfect example of an 
equal Rabi multilevel system’®. The transitions in this system can be concisely 
described by introducing the raising and lowering operators a‘ and a, respect- 
ively, as 


|N+1)=a"|N), |N—1)=alN) (4) 


Note that, in contrast to the commonly employed ladder operators of a harmonic 
oscillator (which has a coupling constant scaling with V/N), it follows from equa- 
tion (4) that a and a’ commute: aa‘ = a‘a. The essential parts of the interaction 
Hamiltonian then take a bi-diagonal form, which can be represented in the raising 
and lowering operators 


2hv 


pA=— ie a+ga‘) sin(wt) +O(a",a!";n>2) (5) 


e 
m 
The higher order contributions can be neglected in the following, as they lead to 
negligible transition probabilities (see below), and terms on the main diagonal are 
absent because the spatial integral over the near-field distribution F(z) (the case of 
Ak = 0) vanishes. 
The S-matrix. To obtain transition probabilities for electrons after passage 
through the near-field, it is convenient to switch to the interaction picture. 
Here, the lowering and raising operators become time-dependent: a(t), a(t). 
In our case, they can be easily expressed in terms of a and at by the transformation 


a(t) = exp(—iot)a, a'(t)= exp(iot)a’ (6) 


and the interaction Hamiltonian turns into 


Hin(t) = sid sin(ct) [exp(—ict)g*a+ exp(iot)ga’] (7) 
where a and a’ denote the time-independent lowering and raising operators (see 
equation (4)). The temporal evolution of the quantum system can be treated in 
terms of a scattering matrix S, defined as a unitary transformation connecting 
asymptotic particle states |y(0o)) =S|w(—0o)) before and after the interaction 
(for the time-dependence of the electron wavefunction during near-field transit, 
see ref. 5). This unitary operator S is given by the time-ordered exponent 


S=Texp| —} | Hs -(t)dt (8) 


In the present case, the time-ordering T can be omitted because a(t) and a(t) 
commute. With the choice of a finite support L of the basis states, the range of 


integration should in principle be limited to ft pe pw which will cancel out the ratio 


v/L appearing in equation (7) for the time-independent contributions. The terms 
oscillating at higher frequencies ({exp(2ict)dt in equation (7) and higher order 
contributions from equation (5)) vanish in the limit of large L. This case of large L 
(L> v/w) corresponds to the experimental situation, in which the momentum 
states are well-resolved with respect to their energy difference i, and therefore, 
the passage to infinity can be carried out without loss of generality: 


oo 


1 a—gat 
5] # Hiyy(t)at = £28" & (9) 


—00 


Thus, the S-matrix in the interaction picture can be finally written as 


LETTER 


(10) 


and interestingly, the scattering matrix takes on the form of a displacement 
operator®. 

The transition probabilities. Using the S-matrix, we can compute the probabilities 
of the transitions |0) +|N), given by Py =|(N|S|0)|?. For this purpose, we first 
split the matrix exponent in equation (10) into a product of two exponents, 
exp(gat - g*a) = exp(ga*)exp(— g’a). This separation is of course possible 
because a and a’ commute. Expanding the exponential operators in a Taylor 
series, we find 


S = est’ gra 


; gy ym a" 
exp(aa')(0)= YF (at)"10)= im) (11) 
and analogously 
; Sey ie yey 
(wlesp(—e'a) = SEF (nar= SET nl (02) 
Using equation (10) and the orthogonality relation (N + n|m) = 5y+n,m, We obtain 
Sere 2 (—|g\*)" 
N|S|O 13 
(NIS10)= DENT 2 Da allt NI as) 


Comparing this result with the following series expansion for the Bessel function 
of the first kind 


=(z/2) DS oh (14) 


we finally obtain 


Py =Jn(2|gl)” (15) 


Therefore, the probability of energy gain or loss is given in the form of Bessel 
functions of different order’. 

Propagation after interaction and Wigner function. The propagation of the elec- 
tron wavefunction after interaction with the optical near-field can be described in 
terms of a unitary evolution operator exp(—iHot/h), where Hp is again the free- 
electron Hamiltonian. Let —/,(t) be the wavefunction in momentum representation 
and p the electron momentum in the laboratory frame. The unitary evolution is 
then given by 


p(t) =e '/"y, (0) (16) 


where E, =cy/ (me)? + p? is the relativistic energy and m is the electron rest mass. 


In practice, the electron momenta p after the interaction are all very close to the 
initial (relativistic) electron momentum ymv, where v and y are the initial electron 
velocity and the Lorentz contraction factor, respectively. For that reason, it is 
convenient to use ‘shifted’ momenta defined as p’ = p — ymv. 

During the free propagation, the momentum distribution |W,(‘)|? remains 
unchanged because the unitary action only changes phases of the probability 
amplitudes y,(0). In contrast to the momentum distribution, the spatial density 
distribution will vary in time during the propagation. In a ‘shifted’ laboratory 
frame, the spatial representation of the wavefunction is given by the Fourier 
transformation 


(ee) 


1 ; 
= | eP*/hy,(t)dp 


0 


W(z—vt,t) = (17) 


where w,(t) is normalized to unity and z— vt is the shifted spatial coordinate. 
Figure 3a in the main text presents a computation of the probability density versus 
the arrival time of the wavepacket in a given plane as a function of the propagation 
distance between the interaction region and this plane. 

The Wigner function of the quantum state (Fig. 3b in the main text) is given by 


W(z—vtp.t) = = | Vpglt)Vp a (the 28*/hdq (18) 


It gives a phase space representation of the quantum state*’ and illustrates the 
propagation-induced pulse compression. 

Numerical calculations. In order to elucidate the importance ofa spatially narrow 
probing beam and a temporally stretched near-field excitation for the observation 
of multilevel Rabi oscillations, we performed numerical calculations to quantita- 
tively characterize the influence of an incoherent averaging over (temporally and 
spatially) varying transition probabilities Py (r,t), where t is the electron arrival 
time and r its position vector in the sample plane (perpendicular to the beam 
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direction). In a different context, that is, in the description of laser-electron cross- 
correlations, similar computations were carried out in refs 5 and 6. 

In a simplified geometry, the nanotip is modelled as a straight cylinder of a 
radius corresponding to that of the tip at the probing position (r= 1 um), for 
which the scattered electric field can be analytically calculated within Mie theory*’. 
The field enhancement at the surface of the tip shaft is about 1.4. In the approxi- 
mated geometry, we obtain a Fourier amplitude of the scattered field of 
- -~80.3 V~!nm (normalized to the incident field F,,,.), in the same order of 
magnitude as in the experiments, and exponentially depending on the distance 
to the surface with a radial decay length of approximately 90 nm. 

In Extended Data Fig. 5a, we study the effect of spatial and temporal averaging 
on the visibility of the Rabi oscillations by averaging over a disk-shaped beam anda 
Gaussian temporal distribution of different widths. The upper and lower graphs 
show the influence of a finite probing area and a reduction of the duration of 
sample excitation, respectively. Both for larger electron beam widths as well as 
shorter laser pulses driving the optical near-field, the Rabi oscillations exhibit 
weaker modulation and become substantially damped. Therefore, for the experi- 
mental electron pulse width of about 800 fs and a field decay of 90 nm, a probe 
radius around 10nm and a near-field duration of 3.4 ps as in the experiments 
(black lines) allow for the observation of strongly modulated Rabi oscillations. For 
these experimental parameters, the sideband populations closely follow the ana- 
lytical Bessel function dependence with minor deviations at higher fields (compare 
Extended Data Fig. 5b). 

As shown in the main text, the sinusoidal phase modulation of the electron 
wavefunction by the interaction with the optical near-field leads to the formation 
of an attosecond pulse train after a certain distance of free propagation behind the 
interaction region. In the experiments, the electron pulse consists of a partially 
coherent ensemble of electrons, and we investigate here the robustness of the 
attosecond pulse train generation to an incoherent averaging over different coup- 
ling constants g and wavefunction evolutions with fluctuating initial energies. We 
find that an initial kinetic energy spread below the photon energy is fully sufficient 
for the formation of a clear attosecond pulse structure. Specifically, Extended Data 
Fig. 6 presents evolution maps of the electron pulse structure as a function of 
propagation distance, incoherently averaging simulations of pure states with an 
initial kinetic energy width of 0.1 eV each. In Extended Data Fig. 6a, b, the electron 
density is incoherently averaged over a range of kinetic energies AE = 0.7 eV and 


2.1eV, respectively. At a range of 0.7 eV, the resulting electron density peak is 
practically indistinguishable from the ideal case of a pure state with 0.1 eV width 
(solid black line in Extended Data Fig. 6e). With increasing spread of the inco- 
herent average, the peak begins to smear out, although its duration in the temporal 
focus is not notably enlarged even for a kinetic energy spread of 2.1 eV, three times 
larger than in the experiment. 

An incoherent average over different coupling constants g experienced by the 
electrons within the electron beam area (lower row) has a different effect: for a 
small probing radius of 10 nm and a decay length of the coupling constant as used 
above, the peak width is not affected, but the depth of the temporal focus is 
broadened (Extended Data Fig. 6c). Increasing the probing radius to 50nm 
(Extended Data Fig. 6d), that is, to a size substantially larger than in the experi- 
ment, the amplitude of the side lobes grows to ultimately affect the attosecond 
temporal resolution. In conclusion, the stability to perturbations in the coupling 
constant and the initial kinetic energy spread demonstrates that attosecond train 
generation will be observable under the given experimental conditions. 
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Extended Data Figure 1 | Experimental setup. Pulses from an amplified 
femtosecond (fs) laser system, at bottom left, are split into two optical beams. 
One of them is frequency-doubled in a B-barium borate (BBO) crystal and, after 
separation from the fundamental beam, focused (lens with numerical aperture 
0.015, 50cm focal length) onto the tungsten needle emitter (W tip) for the 
generation of electron probe pulses. The second beam (pump beam) is 
temporally stretched, attenuated and focused (lens with numerical aperture 
0.014, 20 cm focal length) onto the sample within the TEM (angle of incidence, 
55°). Relative timing between the electron probe and laser pump pulse is 
controlled by an optical delay stage. Optically-induced changes of the 
population of electron momentum states are recorded with an electron energy 
spectrometer. See Methods for details. 
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Extended Data Figure 2 | Spatial characterization of near-field scattering. __tip surface. A slow sample drift results in a scanning artefact in the y direction 


a, Raster scan of the optically-induced electron energy gain and loss prob- (jagged edge of the tip). For the results reported in the main text, a drift 
ability, characterized by the spectral cutoff (top panel) and the sideband correction in the y direction was applied (see Methods section “Data analysis 
populations of the zero loss peak (middle) and the second photon order and drift correction’). b, TEM image of gold tip. Red rectangle, scanning 
(bottom). The field-dependent electron energy spectra shown in Fig. 2 of area displayed in a. c, Electron energy-loss spectra recorded along x = 0 with 


the main text were recorded at an x position indicated by the black line at the varying distance from the tip surface. 
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Extended Data Figure 3 | Pulse characterization by electron-photon cross- _ as function of time delay (inset, relative shift of photon sidebands with 
correlation. a, Differential electron energy-loss spectra as function of time respect to zero loss peak). c, Energy- and time-resolved structure of the electron 
delay (zero loss peak of width 1.3 eV subtracted; the colour scale shows pulse (the colour scale shows the normalized electron density). 


the relative change of spectral density). b, Relative total scattering amplitude 
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Extended Data Figure 4 | Evaluation of sideband populations. Example 
of electron energy spectrum (black dots) showing a number of photon 
sidebands and a weak low-loss plasmon contribution. Lines show fitted 
function used to extract sideband populations (blue) and the plasmon band 
(orange). 
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Extended Data Figure 5 | Influence of spatial and temporal averaging. strongly damped. The black curves correspond to the experimental situation. 
a, Effect of electron beam size (top) and laser pulse duration (bottom) on b, Occupation probabilities of multiple spectral sidebands. Solid lines, Nth- 
the visibility of the Rabi oscillations in the order |N| = 1. For increasing order Bessel functions. Dashed lines, numerical calculations accounting 
electron beam size and decreasing laser pulse duration, the modulations are for temporal and spatial averaging in the experiments. 
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Extended Data Figure 6 | Robustness of attosecond pulse train generation. __ the finite probing-area of the electron beam (c, d). A corresponding line profile 
The influence of the electron beam’s initial energy spread and lateral sizeon at the propagation distance where the electron density peaks are at their 

the temporal peak width of the generated pulse train are shown in the upper = maximum and form an attosecond pulse train is shown in e and f. For the 
and lower rows, respectively. a—d, Evolution of the electron density as a experimental parameters used in this work (energy spread AE = 0.7 eV FWHM 
function of propagation distance after the interaction with the optical near- and electron beam radius r = 10 nm), the peak width remains nearly 

field, incoherently averaged over the initial kinetic energy distribution (a,b) or | unchanged as compared to the ideal (not averaged) case. 


©2015 Macmillan Publishers Limited. All rights reserved 


od es 


doi:10.1038/nature14400 


Global carbon export from the terrestrial biosphere 


controlled by erosion 


Valier Galy', Bernhard Peucker-Ehrenbrink' & Timothy Eglinton!” 


Riverine export of particulate organic carbon (POC) to the ocean 
affects the atmospheric carbon inventory over a broad range of 
timescales'°. On geological timescales, the balance between 
sequestration of POC from the terrestrial biosphere and oxidation 
of rock-derived (petrogenic) organic carbon sets the magnitude of 
the atmospheric carbon and oxygen reservoirs®’. Over shorter 
timescales, variations in the rate of exchange between carbon reser- 
voirs, such as soils and marine sediments, also modulate atmo- 
spheric carbon dioxide levels’. The respective fluxes of biospheric 
and petrogenic organic carbon are poorly constrained, however, 
and mechanisms controlling POC export have remained elusive, 
limiting our ability to predict POC fluxes quantitatively as a result 
of climatic or tectonic changes. Here we estimate biospheric and 
petrogenic POC fluxes for a suite of river systems representative of 
the natural variability in catchment properties. We show that 
export yields of both biospheric and petrogenic POC are positively 
related to the yield of suspended sediment, revealing that POC 
export is mostly controlled by physical erosion. Using a global 
compilation of gauged suspended sediment flux, we derive separate 
estimates of global biospheric and petrogenic POC fluxes of 
157+75 and 4371S! megatonnes of carbon per year, respectively. 
We find that biospheric POC export is primarily controlled by 
the capacity of rivers to mobilize and transport POC, and is largely 
insensitive to the magnitude of terrestrial primary production. 
Globally, physical erosion rates affect the rate of biospheric POC 
burial in marine sediments more strongly than carbon sequest- 
ration through silicate weathering. We conclude that burial of bio- 
spheric POC in marine sediments becomes the dominant long- 
term atmospheric carbon dioxide sink under enhanced physical 
erosion. 

The atmosphere is a small reservoir of carbon in comparison with 
rocks, soils, the biosphere and the ocean. Its size is therefore sensitive 
to small imbalances in the exchange of C with and between these larger 
reservoirs. Over long timescales, the continental biosphere is mostly at 
equilibrium with the atmosphere, because most C fixed by terrestrial 
photosynthesis is quickly returned to the atmospheric reservoir 
through respiration’. However, rivers deliver to the oceans a fraction 
of this net primary production (NPP) as POC and dissolved organic 
carbon (DOC)’*. Although most DOC is quickly returned to the 
atmosphere through oxidation in estuaries and the ocean, a significant 
fraction of riverine POC is buried in marine sediments and stored over 
long timescales. This ‘leakage’ of carbon from the biosphere-atmo- 
sphere loop represents a net sequestration of atmospheric C (ref. 6). 
Rivers also transfer POC from the rock reservoir (petrogenic organic 
carbon, OC) to marine sediments, thereby transferring C between two 
reservoirs disconnected from the atmosphere’. During this transfer, 
oxidation of petrogenic OC represents another leakage of C, in this 
case towards the atmosphere®. The nature and efficiency of riverine 
export of POC to the ocean thus fundamentally affect the long-term 
atmospheric C inventory. Despite its importance, global riverine 
export of POC to the ocean has until now remained poorly con- 


strained**. In particular, the respective global fluxes of biospheric 
and petrogenic POC remain largely unconstrained. More importantly, 
the sensitivities and relative magnitudes of global biospheric and pet- 
rogenic POC export are not well defined, impeding our ability to 
quantitatively predict POC fluxes and their impact on the long-term 
global C cycle under different forcing scenarios. 

Developing accurate constraints on fluvial transfer of biospheric 
POC requires the quantification of, and correction for, petrogenic 
OC in river sediment. Although the presence of petrogenic OC in river 
sediments and river-dominated margin sediments has been inferred 
for decades, its direct and unambiguous detection is quite recent’”"®. 
Consequently, few quantitative reconstructions of petrogenic OC 
fluxes exist. However, those reconstructions encompass such diverse 
river systems as the Amazon, Taiwanese rivers and the Ganges- 
Brahmaputra system”*"". Radiocarbon ('*C) measurements have pro- 
vided key constraints on petrogenic OC concentrations and fluxes. 
Exploiting the absence of '*C in petrogenic OC and its presence in 
biospheric OC, M4C measurements on riverine POC combined with 
additional constraints on the chemical composition of either constitu- 
ent allow these two key constituents to be differentiated”'* '*. Here we 
use published POC compositional data (including '*C measurements) 
to derive a direct, global estimate of the petrogenic OC flux (Methods). 
These new results, together with published petrogenic OC fluxes, pro- 
vide a unique compilation of riverine export of petrogenic and bio- 
spheric POC to the ocean from 43 river systems that account for 20% 
of the sediment discharge to the oceans. 

Petrogenic OC is an integral component of sedimentary and other 
rocks. Its export by river systems is therefore tightly linked to that of 
sediments. Indeed, both Komada et al.” and Hilton et al.'° have shown 
that yields of petrogenic POC (the petrogenic POC flux normalized to 
catchment area) are positively correlated with corresponding yields of 
suspended sediment in the Santa Clara River (California) and 
Taiwanese rivers, respectively, with the latter property serving as a 
measure of spatially averaged physical erosion rate. Our data set 
extends this observation to a broad range of river systems, covering 
more than four orders of magnitude in both catchment size and sedi- 
ment yield (Supplementary Table 1). These data broadly follow a 
power-law relationship characterized by a power exponent close to 
unity (1.11 + 0.13 ) (errors are 1 s.d.; see Methods). Petrogenic POC 
yield thus varies roughly linearly with sediment yield, implying that the 
behaviour of petrogenic OC during erosion and transport is similar to 
that of the other mineral phases (Fig. 1). In particular, this finding 
implies a generally uniform depth-distribution of petrogenic OC in 
soils and rocks. The average petrogenic POC concentration in river 
sediments, however, varies considerably (0.02% to 0.6%). This can be 
explained by variations in the average petrogenic OC content of rocks 
and/or by variable oxidation of petrogenic OC during sediment trans- 
fer to the ocean. Recent studies of large river basins with extensive 
floodplains (Ganges-Brahmaputra and Amazon) indicate that up to 
50% of petrogenic POC initially present in rocks can be oxidized dur- 
ing sediment transport and temporary storage in intermediate reser- 
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voirs’*. In contrast, petrogenic OC is very efficiently preserved in 
fluvial systems characterized by rapid sediment transfer to the ocean, 
such as Taiwanese rivers’® and the Eel River (California)*®. Enhanced 
oxidation of petrogenic OC could explain up to about 50% of the 
observed order-of-magnitude difference in petrogenic OC concentra- 
tion. In addition, average petrogenic OC concentrations in rocks have 
been reported to vary from catchment to catchment by at least an order 
of magnitude’. Together, these observations suggest that whereas 
initial contents in rocks and subsequent oxidation of petrogenic OC 
during sediment transport together are the dominant controls of pet- 
rogenic OC concentration in river sediments, sediment yield—that is, 
physical erosion rate—is the primary control on petrogenic OC export 
efficiency. 

Unlike petrogenic OC, biospheric POC is not an indigenous mineral 
component of the sediment; instead, it is added during vegetation 
growth, soil formation and processes associated with the movement 
of materials from source to sink (for example landsliding or overland 
flow). The controls on its behaviour could therefore differ from those 
affecting the mineral load. Because it is the other component of 
riverine POC, biospheric OC fluxes can be obtained by subtracting 
petrogenic OC contributions from riverine POC fluxes. Here we 
use a compilation of riverine POC fluxes from 70 river systems 
(Supplementary Tables 1 and 2), covering 42.7 X 10°km? (that is, 
40% of the total exorheic continental area) and accounting for about 
45% of the global freshwater discharge to the ocean. For 27 of these 
river systems we lack direct estimates of petrogenic OC contributions. 
We therefore use the relationship between sediment yield and petro- 
genic OC yield (Fig. 1) to estimate petrogenic contributions to POC in 
these systems (Methods). This permits the calculation of biospheric 
POC fluxes and yields for the entire set of 70 river systems. Our data 
show that biospheric POC yield is positively correlated with suspended 
sediment yield, following a unique power-law relationship (1° = 0.78) 
(Fig. 2). Although relationships between riverine POC and suspended 
sediment concentrations have been reported previously’, we extend 
this observation to biospheric POC, accounting for its dilution by 
petrogenic OC at high sediment yield. The singularity of the relation- 
ship between biospheric POC and sediment yield is remarkable, con- 
sidering the very broad range of climate, vegetation, geomorphology 
and anthropogenic disturbance characterizing the drainage basins 
considered. It implies that, globally, the rate of biospheric POC export 
is primarily controlled by sediment export processes. The exponent of 
the power relationship between biospheric POC and sediment yield is 
significantly smaller than 1 (0.56 + 0.03), reflecting an increasing dilu- 
tion of biospheric POC by mineral phases (that is, decreasing bio- 
spheric POC concentrations) at high sediment yield. This reflects 
the well-documented decrease in OC concentration with depth in soil 
profiles, which results in an increase in POC stock with depth that 
globally follows a power law characterized by an exponent of 0.4 (ref. 
22). In general, at low sediment yield, erosion proceeds mostly by 
means of overland flow, exporting surface material (such as soil litter 
and Ohorizons) characterized by high biospheric POC concentra- 
tions. Conversely, at high sediment yield, erosion proceeds by means 
of deep gully erosion and/or landslides, thereby lowering the overall 
biospheric POC concentration of the eroded material”’. 

To further evaluate to what extent productivity and associated soil 
OC content control biospheric POC export, we used the MOD17 
database” to extract basin-scale estimates of NPP for 40 systems. 
Biospheric OC yield and NPP are weakly positively correlated (power 
law with exponent of 1.16 and r = 0.30; Extended Data Fig. 1), sug- 
gesting that productivity does not impose a strong control on bio- 
spheric OC yield. However, the calculated fractions of the NPP 
exported from catchments by rivers vary by more than three orders 
of magnitude (0.01% to 2.1%) and are positively correlated with the 
suspended sediment yield, following a power-law relationship (Fig. 3) 
that is characterized by an exponent (0.50 + 0.05) statistically identical 
to that of the relationship defined by biospheric POC yield. We there- 
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Figure 1 | Relationship between petrogenic OC yield (Ypetro) and suspended 


sediment yield (Y,.4). Catchments larger (black diamonds) or smaller (grey 
dots) than 100,000 km” plot on the same trend. Most of the variability in 
petrogenic OC concentration derives from variable initial OC concentrations in 
rocks and petrogenic OC oxidation during sediment transport. The regression 
line is Ypetro = 0.000721); 7° = 0.82; P<0.001. 
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fore conclude that the rate of biospheric POC export depends prim- 
arily on the capacity of rivers to mobilize and transport POC out of 
catchments, rather than on POC production within the watershed. 
Our data set also reveals significant secondary variations (Figs 2 and 3), 
suggesting the existence of additional control mechanisms. Among 
the possible mechanisms, we postulate that, for a given sediment yield, 
higher frequency and/or deeper landslides result in decreased bio- 
spheric POC yield, illustrating the critical role of physical erosion 
processes in biospheric POC export. Other possible mechanisms 
include the sorption of DOC onto mineral phases in sediment-starved 
rivers (for example the Congo river) that are often characterized by 
high DOC concentrations. However, the lack of correlation between 
DOC concentration and biospheric POC yield suggests that sorption 
of DOC does not exert a significant control on the efficiency of bio- 
spheric POC export. The increasing relative variance of biospheric 
POC yield at a low concentration of suspended sediment—conditions 
that promote aquatic primary production—suggests that within-river 
biological productivity could explain part of the secondary variability 
of biospheric POC yield. Transient non-steady-state erosion of soil 
and biosphere reservoirs can also introduce some variability in bio- 
spheric POC yield that might not be accounted for by either sediment 
yield or NPP. Finally, human activities such as damming and agricul- 
ture currently affect virtually all modern river systems and probably 
influence biospheric POC yields. 
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Figure 2 | Relationship between biospheric POC yield (¥,;.,) and suspended 
sediment yield. Data obtained by subtracting measured petrogenic OC fluxes 
from riverine POC fluxes (black dots) and those obtained using petrogenic OC 
fluxes inferred from the relationship shown in Fig. 1 (grey dots) plot on the 
same trend. The regression line is Yyio, = 0.081 Y2.3°; : = 0.78; P< 0.001. 
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The strong relationships between suspended sediment yield and 
POC yield (both petrogenic and biospheric) allow global POC fluxes 
to be inferred from the better-constrained global sediment flux. Here 
we use recent estimates of the suspended sediment flux to the ocean”*”” 
to estimate global biospheric and petrogenic POC fluxes. We use a 
global suspended sediment flux of 19,000 + 500 Mt yr ' and a corres- 
ponding suspended sediment yield of 176 + 8tkm “yr | (Methods). 
In turn, using the overall relationship between suspended sediment 
yield and biospheric POC yield (Fig. 2), we estimate a global bio- 
spheric POC yield of 1.46 +968 tC km * yr_', which translates into a 
global biospheric POC flux of 157*25 MtC yr’. A similar approach 
applied to the relationship between suspended sediment yield and the 
fraction of the NPP exported by rivers (Fig. 3) gives a global value of 
0.18% of terrestrial NPP being exported to the ocean. Combined with 
a mean estimate of the global terrestrial NPP of 77.6GtCyr’ ° (ref. 
24), this gives a global biospheric POC flux of 14026 Mt Cyr’ that 
is statistically identical to our estimate based on biospheric POC 
yield data. Annually, about 0.02% of the total mass of C present in 
the atmosphere is thus transferred to the ocean as POC, showing 
that biospheric POC sequestration can affect the size of the atmo- 
spheric reservoir over timescales as short as 10°-10* years. 

The dependence of petrogenic OC yield on OC concentrations 
in rocks complicates the estimation of the global petrogenic POC 
flux from our estimate of global suspended sediment yield. Ideally, 
the distribution of rocks characterized by variable OC concentra- 
tions as well as intrinsic geomorphic characteristics (such as the 
size of the floodplain and the spatial distribution of physical ero- 
sion) need to be taken into account, because they both exert strong 
control on petrogenic OC yields. In the absence of such a model, 
we can only assume that the 43 river systems that we characterized 
are representative of the natural variability (that is, rock types and 
catchment morphology). Using our global suspended sediment 
yield of 176+8tkm “yr ', we estimate a global petrogenic 
POC flux of 437$! MtCyr_' (Methods). 

Finally, we derive a combined global flux of terrestrial POC to the 
ocean of 200743 MtCyr_', of which about 80% and 20% are bio- 
spheric and petrogenic POC, respectively. This direct estimate of these 
two fluxes provides an assessment of the magnitude of POC transfer 
from the terrestrial biosphere to the ocean, and reveals the global 
significance of petrogenic OC as a component of POC export by rivers 
to the ocean. However, these fluxes do not take bedload transport into 
account and must therefore represent a lower bound of actual petro- 
genic and biospheric POC fluxes to the ocean. Indeed, bedload mater- 
ial can be dominated either by petrogenic OC’* or biospheric POC”, 
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Figure 3 | Relationship between the proportion of NPP exported annually 
(NPP,,,) and suspended sediment yield. Normalization of biospheric POC 
export to NPP does not remove its dependence on suspended sediment yield, 
illustrating the overarching control exerted by physical erosion on biospheric 
POC export. The regression line is NPPexp = 0.013Y2.3; 7° = 0.71; P<0.001. 
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implying that bedload transport contributes globally to the fluvial 
export of both petrogenic and biospheric POC. 

On geological timescales, petrogenic and biospheric OC have 
opposing roles in the global C cycle: the net transfer of C between 
atmospheric and terrestrial reservoirs is set by the balance between 
petrogenic OC oxidation and biospheric POC burial. We show that the 
rate of both petrogenic and biospheric POC export from the conti- 
nents is controlled primarily by the rate of sediment export; that is, 
physical erosion. The preservation of both petrogenic and biospheric 
POC in the ocean is up to three times higher at high physical erosion 
rates'*'*'°, Thus, increased physical erosion rates favour efficient 
transfer and burial of biospheric POC coupled with enhanced preser- 
vation (that is, decreased oxidation) of petrogenic POC, both acting to 
limit the return of carbon to the atmospheric reservoir. The small 
fraction of NPP exported even at very high erosion rates (a few per 
cent; Fig. 3) suggests that enhanced export of biospheric POC is sus- 
tainable over long timescales, as the terrestrial biospheric OC reservoir 
is continuously replenished by photosynthetic C fixation. Using avail- 
able estimates of terrestrial POC burial efficiency, we show that bio- 
spheric POC burial yield is positively correlated with sediment yield 
(Extended Data Fig. 2). Globally, the rate of C sequestration through 
silicate weathering has a weaker sensitivity to sediment yield 
(Extended Data Fig. 2) as a result of kinetic limitation of weathering 
reactions at high physical erosion rates”. Biospheric POC burial is thus 
predicted to become the dominant long-term atmospheric CO, sink 
under a fourfold increase in global physical erosion rate at constant 
temperature. We conclude that tectonic and climatic forcing of phys- 
ical erosion favours biospheric POC sequestration over silicate weath- 
ering. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Quantitative apportionment of biospheric and petrogenic POC in river sedi- 
ments. Several methods have recently been used to apportion petrogenic and bio- 
spheric POC quantitatively in river sediments. All of these methods are based on 
the unique property of petrogenic OC that it does not contain any '4C, whereas 
biospheric POC does. However, the '*C content of biospheric POC is difficult to 
predict a priori, because it depends on a complex array of processes such as physical 
erosion, soil formation and the dynamics of the biosphere itself. Therefore, in most 
cases, additional constraints on the chemical composition of biospheric and/or 
petrogenic OC are needed. Galy et al.'* proposed that the different hydrodynamic 
properties of petrogenic and biospheric POC in large streams and rivers enables the 
use of bulk radiocarbon measurements on POC in suspended and bed sediments 
collected along depth profiles to estimate both petrogenic OC concentration and 
biospheric POC “C content. The underlying assumption is that petrogenic OC is 
uniformly distributed in the water column (owing to its physical properties and size 
distribution), whereas the relative concentration of biospheric POC—which is 
preferentially associated with fine-grained sediments—decreases predictably with 
depth. This approach has been tested and yields robust results for the Ganges- 
Brahmaputra”", Amazon™? and Fraser* rivers. Here we reanalyse published '4C 
data and show that the approach developed by Galy et al.’ for suspended sediment 
depth profiles can also be applied to size-fractionated and density-fractionated 
fluvial sediments as well as to suspended sediments sampled across a wide range 
of flow conditions. The 'C content of density-fractionated sediments derived from 
the Mississippi river*’ and of time-series suspended sediments of the Ishikari river** 
provide excellent examples (Extended Data Figs 3 and 4). This type of binary 
mixing approach is, however, not always appropriate, either because adequate data 
may not be available or because of non-systematic behaviour of biospheric POC in 
the water column. Indeed, biospheric POC often reflects the mixing of several 
components such as soil and fresh plant debris, which may associate with the 
mineral load in different ways and have different hydrodynamic properties. Here 
the Mackenzie river provides a good example. Recently fixed plant-derived bio- 
spheric OC and very old permafrost-derived biospheric OC are preferentially 
associated with coarse and fine sediments, respectively, undermining the binary 
mixing approach””. In these circumstances, additional constraints on the composi- 
tion of the petrogenic and biospheric end-members are needed. These can be 
obtained by using bulk (for example N/C, 5'°C or 5'°N) or compound-specific 
(4C) data, as demonstrated for Taiwanese rivers", the Eel River'? and the 
Mackenzie River**. Once a priori compositions of the petrogenic and different 
biospheric end-members have been established, simple mixing models can be used 
to apportion petrogenic and biospheric POC quantitatively. We used a combina- 
tion of the above-mentioned techniques to evaluate petrogenic OC concentrations 
in river sediments by using published and newly acquired POC characterizations. 
Supplementary Table 1 summarizes the sources of the data, the methods employed 
and the results obtained. 

In some cases, POC fluxes have been measured but adequate data to quantify 
petrogenic and biospheric POC are not available. In these cases (27 rivers; 
Supplementary Table 2) we estimate the petrogenic OC yield by using the rela- 
tionship between suspended sediment yield and petrogenic OC yield defined by all 
rivers for which we could quantify petrogenic and biospheric OC (43 rivers; Fig. 1 
and Supplementary Table 1). The significant scatter around this relationship 
introduces uncertainty in the estimation of petrogenic OC yield from suspended 
sediment yield. We therefore estimated the uncertainty of calculated petrogenic 
OC yield on the basis of the uncertainty of the relationship between suspended 
sediment yield and petrogenic OC yield. Biospheric POC yield is obtained by 
subtracting inferred petrogenic OC yield from riverine POC yield. The relation- 
ship between biospheric POC yield and suspended sediment yield is identical 
when the two groups of data—that is, data determined from geochemical char- 
acterization versus data inferred from the relationship between suspended sedi- 
ment yield and petrogenic OC yield—are considered separately (Fig. 2). 
Specifically, the exponents and multiplying terms of the two relationships are 
statistically identical (within +1 s.d.): 0.51 + 0.04 and 0.59 + 0.09 for the expo- 
nents, and —0.93 + 0.13 and —1.19 + 0.15 for the multiplying term. This shows 
that inferring petrogenic OC yield from the relationship between suspended sedi- 
ment yield and petrogenic OC yield does not introduce a systematic bias in the 
determination of biospheric POC yield. 

Estimating global riverine petrogenic and biospheric POC fluxes. Robust rela- 
tionships between both petrogenic and biospheric POC yields and suspended 


sediment yield enable the use of global suspended sediment yield to estimate global 
petrogenic and biospheric POC fluxes. Global suspended sediment fluxes have 
been the subject of extensive research over the past several decades. Suspended 
sediment fluxes for individual gauged rivers have been compiled**”***”* and used 
to derive estimates of global suspended sediment fluxes. Usually, fluxes were first 
extrapolated regionally (for example by grouping rivers according to the oceanic 
basin they drain into) to account for regional differences in average suspended 
sediment yield (for example very high in small ocean islands versus small in the 
Russian Arctic). Global fluxes were then obtained by summing the regional fluxes 
for all areas draining into oceanic basins (in other words, endorheic systems are 
excluded from the global estimate). Peucker-Erhenbrink’® and Milliman and 
Farnsworth” have provided the two most comprehensive recent compilations 
of gauged suspended sediment fluxes. They estimated the global suspended sedi- 
ment flux to the ocean at 18,548 and 19,000 Mt yr me respectively. Recently, Larsen 
et al.’ estimated the global exorheic denudation flux at 19,000 Mtyr~', using an 
empirical denudation model based on the strong relationship between denudation 
rates and topography. Here we use a global suspended sediment flux of 
19,000 = 500 Mt yr. Graham et al.*’ estimated the global exorheic land surface 
at 110 X 10°km”, whereas Syvitski et al.** proposed a slightly lower estimate of 
106 X 10°km/. The difference between these two estimates probably derives from 
differences in corrections for endorheic drainage areas”*. Here we use an average 
value of (108 + 2) X 10° km’, resulting in an estimated global suspended sediment 
yield of 176 + 8tkm “yr. It should be noted that this value attempts to correct, 
as far as possible, for recent damming of river basins, because pre-dam sediment 
fluxes were used to estimate the global suspended sediment fluxes whenever 
possible both by Peucker-Ehrenbrink et al.’° and by Milliman and Farnsworth”. 
Finally, we use the calculated value for global suspended sediment yield (176 + 8 t 
km ~* yr” ') and our relationships between petrogenic and biospheric POC yields 
and suspended sediment yield to derive estimates of the global petrogenic and 
biospheric POC fluxes. To estimate the uncertainties associated with calculated 
global fluxes we first use Monte Carlo simulations (10,000 repetitions) to account 
for the uncertainty associated with the determination of the relationships between 
petrogenic and biospheric POC yields and suspended sediment yield. Then we 
propagate the uncertainty (+8tkm™ yr) in the global suspended sediment 
yield. Last, we compare the lower bound of the calculated global petrogenic and 
biospheric POC fluxes with the sum of all petrogenic and biospheric POC fluxes, 
respectively, and use the highest of the two values as the lower bound of our final 
estimate. 
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Extended Data Figure 1 | Global relationship between biospheric POC yield and NPP. Basin-averaged NPP estimates were derived from the MOD17 
database”. 
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sequestration yield and sediment yield. CO, sequestration through terrestrial dotted lines show CO} sequestration through terrestrial biospheric POC burial 
biospheric POC burial (black dots; Ycorg) is more sensitive to sediment yield for the entire set of biospheric POC export data (Fig. 2), assuming constant 
than CO, sequestration through silicate weathering (grey crosses; Yogi). At burial efficiencies (BE) of 30 and 100%. CO, sequestration data through silicate 
high physical erosion rates (that is, high sediment yield), the burial of terrestrial weathering are from Gaillardet et al."°. P = 0.001; r° = 0.80. 
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Extended Data Figure 3 | Organic carbon and radiocarbon contents of bulk _ fraction (Fm) and organic carbon content). The linear best fit gives the absolute 
suspended sediments and grain size fractions in the Mississippi River. petrogenic OC content (0.05%) as well as the Fm of the biospheric POC (0.88). 
Results are expressed as modern organic carbon (that is, the product of modern Data from Wakeham et al.*! and Rosenheim et al.*”. P = 0.001; r = 0.99. 
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Extended Data Figure 4 | Organic carbon and radiocarbon contents of bulk _fit gives the absolute petrogenic OC content (0.54%) as well as the Fm of the 
suspended sediments from the Ishikari River, collected over a wide range of _ biospheric POC (1.00). Data from Alam et al.>”. P = 0.001; 1° = 0.99. 
flow regimes. Results are expressed as in Extended Data Fig. 3. The linear best 
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Multi-omics of permafrost, active layer and 
thermokarst bog soil microbiomes 


Jenni Hultman'+, Mark P. Waldrop”, Rachel Mackelprang*’, Maude M. David', Jack McFarland’, Steven J. Blazewicz?, 
Jennifer Harden’, Merritt R. Turetsky”, A. David McGuire®, Manesh B. Shah’+, Nathan C. VerBerkmoes’, Lang Ho Lee®, 


Kostas Mavrommatis*} & Janet K. Jansson’*?:!0+ 


Over 20% of Earth’s terrestrial surface is underlain by permafrost 
with vast stores of carbon that, once thawed, may represent the largest 
future transfer of carbon from the biosphere to the atmosphere’. This 
process is largely dependent on microbial responses, but we know 
little about microbial activity in intact, let alone in thawing, per- 
mafrost. Molecular approaches have recently revealed the identities 
and functional gene composition of microorganisms in some perma- 
frost soils’* and a rapid shift in functional gene composition during 
short-term thaw experiments’. However, the fate of permafrost car- 
bon depends on climatic, hydrological and microbial responses to 
thaw at decadal scales”®. Here we use the combination of several 
molecular ‘omics’ approaches to determine the phylogenetic com- 
position of the microbial communities, including several draft gen- 
omes of novel species, their functional potential and activity in soils 
representing different states of thaw: intact permafrost, seasonally 
thawed active layer and thermokarst bog. The multi-omics strategy 
reveals a good correlation of process rates to omics data for domi- 
nant processes, such as methanogenesis in the bog, as well as novel 
survival strategies for potentially active microbes in permafrost. 

We collected replicate permafrost soil cores (including the seasonally 
thawed active layer) from a permafrost plateau and an adjacent young 
thermokarst bog from a site in interior Alaska that represents future 
anticipated ecosystem transformations as permafrost thaws’. Owing to 
the recognized difficulty in cultivating the majority of microorganisms 
from soil’, we combined several culture-independent ‘omics’ approaches: 
targeted 16S rRNA gene sequencing (16S) to determine the microbial 
community composition; total metagenomic DNA sequencing (MG) 
to determine the complement of phylogenetic and functional genes; total 
metatranscriptome RNA sequencing (MT) to determine which genes 
were expressed; and shotgun mass spectrometry-based metaproteomics 
(MP) to determine which proteins were produced. Together these ana- 
lyses resulted in a large amount of data including 84.2 gigabases (Gb) of 
MG sequence, 20.4 Gb of MT sequence and approximately 7,000 pro- 
teins, which are among the highest yields obtained for any soil type so 
far’. We also compared the different molecular approaches among 
each other and to measured process rates. We analysed relative differ- 
ences in gene expression among samples on the basis of the ratio of 
functional gene transcripts to genes (MT/MG). 

The permafrost, active layer and thermokarst bog soils had a unique 
complement of genes, transcripts and proteins (Extended Data Fig. 1). 
Bacteria and Archaea dominated the data, with only a small fraction cor- 
responding to fungi (MG 0.07 + 0.04%, MT 0.41 + 0.47%, zero in MP), 
so we omitted fungi from further analyses. There was greater overlap 


in shared genes between the soils, compared with transcripts and pro- 
teins (Extended Data Figs 1 and 2). Permafrost had fewer and more 
unique transcripts and proteins than active layer and thermokarst bog 
soils (Extended Data Figs 1 and 2). By contrast, many transcripts were 
shared between the bog and active layer, including many transporters 
(Extended Data Figs 1d and 3). 

The most abundant bacteria found in the permafrost soil on the basis 
of 16S were members of Chloroflexi (19.2 + 6.3% s.d.), Proteobacteria 
(17.9 + 1.3%) and Actinobacteria (12.5 + 3.1%) (Fig. 1a). Similar dis- 
tributions were observed in 16S, MG and MT data sets (correlation 
coefficient r>0.9 for 16S:MG and MG:MT; r>0.8 for 16S:MT, 
Fig. 1b). However, some differences were found, such as higher rep- 
resentation of Actinobacteria in MG than 16S (27% and 12%, respect- 
ively; P = 0.007), partly explained by biases used in the 16S approach. 
Also, some phyla were more or less represented in the MT than the 16S 
and MG data, and we propose that these differences can be used to pre- 
dict their relative activity at the time the samples were collected. In per- 
mafrost, the MT/MG ratios were highest for Proteobacteria (MT/MG 
= 1.7), Acidobacteria (MT/MG = 1.4), and Firmicutes (MT/MG = 1.6), 
suggesting that these representatives were acclimated to be active in 
subzero temperatures (Fig. 1b). Also, binning of MG assemblies from 
permafrost resulted in a draft genome of a novel Acidobacterium (PF_ 
400, Extended Data Table 1) with closest sequence similarity to iron- 
reducing Acidimicrobium ferrooxidans (Genome-to-Genome Distance 
Calculator’? DNA-DNA hybridization (DDH) 19.60% + 2.30). The draft 
genome contained many interesting features including genes involved 
in Fe(III) transport, and cytochromes putatively involved in iron uptake 
and reduction (Extended Data Table 1). The genetic capacity for iron 
reduction in permafrost correlated well with dissimilatory Fe(III) re- 
duction rates measured at the site (Table 1). 

The active layer exhibited both more species and functional diversity 
than the other two soils (Fig. 1a), consistent with previous findings**"”. 
Members of the Actinobacteria, Acidobacteria and Proteobacteria phyla 
had the highest MT/MG ratios (Fig. 1b), suggesting that they were 
among the most active members in the seasonally thawed soil. A draft 
genome of a novel member of one of the Proteobacteria (AL_334) was 
also obtained from the active layer MG (Extended Data Table 1). Func- 
tional screening of the MG and MT data revealed many genes involved 
in specific biogeochemical cycles, including those for denitrification, 
nitrate reduction, iron reduction and methane oxidation (see Supplemen- 
tary Information for brief overview), correlating with process measure- 
ments at the site (Table 1). 
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Figure 1 | Microbial phylogenetic composition 
in permafrost, active layer and thermokarst 

bog soils. a, Relative abundance of phyla on the 
basis of targeted 16S rRNA gene sequencing (16S), 
16S rRNA gene reads from metagenomes (MG) 
and from metatranscriptomes (MT). The data 
represent the average of four replicates for 16S, two 
for MG and two for MT, except for permafrost 
MT which was obtained from only one sample 
owing to low RNA yield. b, Ratios of average 
relative abundance of ten abundant phyla in the 
metatranscriptomes relative to the metagenomes 
(MT/MG). Blue, permafrost; green, active layer; 
pink, bog. Bars marked with an asterisk are 
significantly different (two-tailed t-test, a P value of 
0.05 was used as the significance level) from 
MT/MG = 1.0, to highlight potentially relatively 
higher activity. 
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In the thermokarst bog, the combined omics data correctly predicted 
that methanogenesis was predominant where high rates of methane 
production were observed (Table 1 and Fig. 2). Several genes involved 
in methanogenesis (mcrABG) were detected in the MG and MT data 
sets. The MT/MG ratios for methanogenesis were higher in the bog than 
any other site (Fig. 2). Also, the Euryarchaea had high MT/MG ratios of 
1.69 + 1.14 (Fig. 1b), and methanogen 16S rRNA gene sequences, tran- 
scripts and proteins were highly abundant (Figs 1-3). Of all the 16S se- 
quences, 6.8-10.5% were assigned to Methanosarcina, a metabolically 


diverse methanogen that can produce methane using a variety of meta- 
bolic routes, including the acetoclastic pathway (Fig. la and Extended 
Data Fig. 4). We also binned three draft methanogen genomes from the 
bog MG (Extended Data Table 1), with the largest bin, Bog_15, con- 
taining genes for methane production via the hydrogenotrophic route, 
several of which were expressed and detected in the MT and the MP as 
well (Extended Data Table 1). A methanogen genome binned from an 
arctic wetland MG, Methanoflorens stordalenmirensis, was recently re- 
ported*, with low sequence similarity to those we found here (DDH 


Table 1 | Soil characteristics and process rates from the permafrost, active layer and thermokarst bog 


Permafrost Active layer Bog 

Plant community Black spruce/feathermoss Sphagnum moss 
Soil dissolved organic carbon (mg g™') 0.08 + 0.017 1.05 + 0.18* 0:95 +0.21* 
Soil total dissolved nitrogen (mgg +) 0.02 + 0.002+ 0.08 + 0.01* 0.07 + 0.01* 
pH 5/6 20,16* 4.69 + 0.104 4.88 + 0.02+ 
Aerobic respiration (ug Ch +g”) 0.09 + 0.01+ 0.09 + 0.01+ 1.61 +0.40* 
Anaerobic respiration (ug Ch! g~+) 0.02 + 0.002+ 0.03 + 0.01+ 1.21 +0.27* 

ethanogenesis (ng Ch +g 1) ND ND 454 +150 
Denitrification (ng Ns0h~*g~*) 1.0+0.7+ 3.0 + 0.8*+ 6.4+14* 

itrate reduction (ng NO3h ‘g™*) 0.19 +0.10* 0.40 + 0.08* 132+ 36+ 
Fe reduction (ng Feh *g™ +) 10.0 + 8.0+ 26.2 +3.1* ND 
Sulphate reduction (ng SO,h~1g~) 2.9+0.7+ 4.2 +0.6*+ 71+17* 
Aerobic methane oxidation (ng Ch *g™+) NA 200 + 150 700 + 400 

oisture (%) 58.2 +3.3t 81.4+2.7* $3.5 = 1.27 
Electrical conductivity (uSiemens) 379 = 53* 272+ 19* 242 + 13* 


Units are means + 1 s.e.m. ND, not detected; NA, not determined. Footnote symbols (*, +) denote significant differences among soils (P< 0.05). 
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Figure 2 | Average MT/MG ratios. a, Permafrost; b, active layer; 

c, thermokarst bog soils for two replicate cores. d, Comparison of MT/MG on 
the same plot. Error bars, s.e.m. Amm, ammonia uptake; Denit, denitrification; 
Met, methanogenesis; Mox, methane oxidation; Nif, nitrogen fixation; Sul, 
sulphate reduction. 


20.60-37.30%). Closest sequenced relatives to our draft genomes were 
Methanosarcina acetivorans (Bog_301, 69.70% + 2.92) and Methano- 
saeta concilii (with Bog_232 46.60% + 2.58 and Bog_15 42.70 + 2.53 
DDH similarity (Extended Data Table 1)); thus they represent prev- 
iously undescribed methanogen genomes. 

Two novel Chloroflexi draft genomes were also binned from the bog 
MG (Extended Data Table 1). One of them, Bog_440, with closest sim- 
ilarity to a representative of the genus Anaerolinea (A. thermophila, 
DDH 15.20% + 2.14), was phylogenetically similar to a dominant oper- 
ational taxonomic unit (OTU) in the 16S data (Fig. la and SOGA31 in 
Extended Data Fig. 5) and has previously been reported as abundant in 
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Figure 3 | Relationships among MG, MT and MP for nitrogen cycling 
(circle), sulphur cycling (square) and methanogenesis (triangle) in 
permafrost (blue), active layer (green) and thermokarst bog soils (pink). 
Points represent the sum of all data involved in the particular process. 
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other arctic samples on the basis of 16S*”*, but represents an under- 
sampled phylum with few isolates and none with sequence similarity 
to those we found here. Draft Bog_440 possessed some genes involved 
in sulphate reduction and iron utilization (Extended Data Table 1), 
suggesting that it may play a role in these processes. 

The metaproteomics data enabled us to detect the dominant micro- 
bial proteins actually produced and/or stabilized in the soil. Most of the 
proteins matched to the metagenomes and many were ‘uncharacterized’ 
or ‘hypothetical’ (Table 2 and Supplementary Table 1), or predicted 
to be involved in housekeeping functions (Extended Data Fig. 1 and 
Fig. 4). In permafrost, we detected a relatively high representation of 
cold-shock proteins, presumably for survival under frozen conditions 
(Fig. 4a and Supplementary Table 1). Surprisingly, proteins involved in 
chemotaxis and motility were also observed in permafrost (Supplemen- 
tary Table 1), suggesting that some members of the community had the 
capacity for motility. Fewer transporters were detected in permafrost 
than active layer and bog soils (Fig. 4a and Supplementary Table 1), as 
also reflected by lower levels of transporter transcripts in permafrost 
(Extended Data Supplementary Fig. 3). Several cold-shock proteins 
were also detected in the active layer (Fig. 4b), reflecting the need for 
the community to cope with not only the seasonal shifts in temperatures, 
but also exposure to extreme freezing conditions. Other abundant pro- 
teins in the active layer included several transporters (Extended Data 
Fig. 1, Fig. 4b and Supplementary Table 1), indicative of greater poten- 
tial for microbial transport of nutrients in the thawed soils (Extended 
Data Figs 1b, d and 3). Unlike the permafrost and active layer, there 
were fewer proteins linked to cold tolerance and other stress-related 
reactions in the bog soils (7%, 14% and 12% in bog, active layer and 
permafrost, respectively) (Supplementary Table 1). 

In addition to proteome searches against the matched metagenomes, 
we also performed searches to microbial genomes: 180 environmental 
isolates, plus an additional 13 recently sequenced isolates from cold 
environments (Supplementary Table 3). Rhodoferax ferrireducens had 
many (58) protein matches in permafrost (Table 2 and Supplementary 
Table 1) although it was relatively rare in the MT (<0.001%), MG 
(<0.005%) and 16S sequences (0.1%). Several proteins were also assigned 
to some betaproteobacteria, nitrifiers, acidobacteria, facultative methy- 
lotrophs (Fig. 4a and Supplementary Table 1) and to several psychro- 
phile isolates (Fig. 4a and Extended Data Table 2). More proteins were 
assigned to a wider range of the reference genomes in the active layer 
than permafrost soil (Table 2), indicating a larger diversity of active 
species in this soil, including several Proteobacteria, in addition to 11 of 


Table 2 | Twenty most abundant hits of metaproteome data to gen- 
omes/metagenomes 


Reference metagenome/genome Permafrost Active layer Bog 
Permafrost metagenome 41.6+3.3 32.347.4 21.1+12.3 
Active layer metagenome 54.6 + 37.4 50.5+8.2 21.8+ 22.5 
Bog metagenome 38.4+0.3 466+48 73.3+10.7 
Bradyrhizobium japonicum USDA 110 2641.1 6342.1 2642.1 
Burkholderia xenovorans LB400 242.3 3.042.171 01401 
Arthroderma benhamiae CBS 112371 0.3 +04 2.843.2 2.3+2.7 
Rhodopseudomonas palustris CGA009 10+03 24414 1.52415 
Nitrobacter winogradskyi Nb-255 18208 2.2+0.9 1.0+09 
Burkholderia cepacia 383 15217 2021.1 Q120.1 
Delftia acidovorans SPH-1 14419 1941.7 04+0.6 
Methanosaeta concilii GP-6 0.1 +0.0 0.0+0.0 2.7+2.3 
Methanosarcina mazei Gol, DSM 3647 0.0 + 0.0 0.0 + 0.0 2.0+ 0.4 
Solibacter usitatus Ellin6076 1321.1 6+09 1.5+1.0 
Nitrobacter sp. Nb-311A 0.9+0.2 7413 1.1+0.9 
Rhodoferax ferrireducens T118 4.0+ 2.6 7+09 0.5+0.6 
Acidovorax sp. JS42 1.9+1.6 6+1.0 0.4+0.5 
Ensifer medicae WSM419 1922.5 04+04 0.4+0.0 
Ralstonia solanacearum GMI1000 18+1.6 8+08 03+0.2 
Methylobacterium nodulans ORS 2060 L7#12 240.2 1.0203 
The ten most abundant database hits are highlighted in bold for each soil. Each metagenome data set 
(first three rows) was obtained from the same source sample as the metaproteomes. Based on the 
average normalized spectral abundance factor (NSAF) values and standard deviation of four replicate 


Velos/Orbitrap mass spectrometer runs for each soil. 
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Figure 4 | Visualization of proteins identified 
from metaproteomics data sets. a, Permafrost; 
b, active layer; c, thermokarst bog soils. The 
proteins were identified by searching against 
databases comprising matched metagenomes from 
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the R. ferrireducens proteins mentioned above (Supplementary Table 1), 
and to some of the psychrophile isolates with the majority matching 
to Rhodonellum psychrophilum and Winogradskyella psychrotolerans 
(Supplementary Table 3). In the bog, several proteins matched to meth- 
anogens, including Methanosarcina mazei, Methanosaeta concilii, Methy- 
lobacterium nodulans, Methanoflorens stordalenmirensis (Table 2, 
Supplementary Table 3 and Fig. 4c) and to our draft methanogen ge- 
nomes, reflecting the high rates of methanogenesis in the bog. These 
included proteins involved in methanogenesis, nitrogen metabolism, 
iron transport and tolerance of cold and redox induced stress (Fig. 4c). 
As in the other samples, many proteins also matched to some of the 
psychrophile genomes (Supplementary Table 3). 

Comparison of the MG, MT and MP data from the three soils pro- 
vides insights into the linkages between omics data and processes. Pro- 
cesses that were detected in each of the three omics data sets, namely 
nitrogen, sulphur and methane cycling pathways, are shown in Fig. 3 
(see Supplementary Information for details). The results indicate that 
permafrost microbial communities have a lower general functional 
potential than thawed soils, but also provide insights into adaptation 
strategies for life in frozen conditions. We found that global molecular 
data were a poorer predictor of biogeochemical process rates in perma- 
frost than the other soils, probably owing, in part, to the frozen state of 
the soil. All measured process rates were generally low in frozen per- 
mafrost, but relative abundances of transcripts and proteins for several 
genes and proteins of some specific pathways, such as methane oxida- 
tion, were nonetheless similar to those for the active layer. It is possible 
that some of the RNA and proteins were preserved for long periods in 
this frozen environment, or that they were maintained in microbes that 
are surviving in a dormant state. Alternatively, some microbes in per- 
mafrost are actively expressing these genes and translating them into 
proteins in subzero conditions. In particular, dissimilatory Fe(III) re- 
duction emerged as a potential metabolic strategy used by microbes in 
permafrost, as supported by combined omics data and process rates. By 
comparison, the active layer communities expressed genes and proteins 
involved in obtaining energy and nutrients from a diversity of aerobic 
and anaerobic processes and were equipped with functions for survival 
under freeze-thaw conditions. The bog represented a different scenario 
with a very high measured rate of methanogenesis and correspondingly 
high relative abundances of genes, transcripts and proteins involved 
in methanogenesis, thus demonstrating the potential linkage between 
molecular data and ecosystem level process rates, particularly when the 
process is dominant. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Site description. The sites used in this experiment are part of the Alaska Peatland 
Experiment (APEX), which is part of the Bonanza Creek Long-Term Ecological 
Research Program just outside Fairbanks, Alaska (64.70° N, —148.3° W). The for- 
ested site is underlain by permafrost, is dominated by black spruce (Picea mariana) 
and has an understorey composed of Vaccinium uliginosum, Vaccinium vitis-idaea, 
Calamagrostis canadensis and occasionally Eriophorum vaginatum, Larix laricina 
and Salix bebbiana. Understorey non-vascular vegetation is composed of moss spe- 
cies Hylocomium splendens, Sphagnum sp., Dicramium sp. and lichen. 

The thermokarst bog site is primarily composed of Sphagnum mosses, as well as 
vascular species Carex aquatilis, Eriophorum chamissonis, Chamaedaphne calycu- 
lata and Carex chordorrhiza, and contains recently deceased black spruce trees. 

During the time of sample collection, the temperature of the surface permafrost 

was just below freezing, much like most of the surface permafrost in interior Alaska 
(60cm: maximum —0.052 °C, minimum —2.18 °C). In the thermokarst bog, the 
soil temperature ranged from 3.49 to —0.08 °C at 100 cm to 6.6 to —2.69 °C at 50 cm. 
The pH at the sites was 4.8 in active layer, 4.9 and 5.8 in permafrost (1:1 soil water 
ratio). 
Sample collection. Samples of the active layer and permafrost (black spruce site) 
were taken in summer 2009 (for DNA extraction, four cores per soil) and summer 
2012 (for process rate measurements) by using a SIPRE corer that produces a7.5 cm 
diameter core down to 1 m. Maximum active layer depth was measured at 55 cm at 
this site. Samples of the active layer were at approximately 30-35 cm depth and of 
permafrost were 65-75 cm depth. 

The thermokarst site underwent a thaw approximately 100 years ago supported 
by air photographic interpretation (Turetsky) and radiocarbon analysis of the tran- 
sition between permafrost and thermokarst soils. Cores (for DNA extraction, four 
cores) were taken using a sharpened polyvinyl chloride (PVC) pipe of diameter 
4 feet (~1.22 m) and by cutting around the bottom of the PVC pipe as it was slowly 
inserted into the bog, without compacting the organic materials. When the PVC 
pipe was inserted as far as it could go (approximately 1.2 m), the bottom of the PVC 
tube was covered with a PVC cap and the core was slowly raised out of the ground. 
The PVC core was quickly wrapped in a plastic bag, taped tightly and placed into a 
cooler with dry ice to be immediately frozen. Samples for microbial analysis were 
taken at approximately 100 cm depth. Samples for process rate measurements were 
collected in April 2011 using a SIPRE corer. In the field, cores were wrapped in 
aluminium foil and packed in dry ice for transport to our laboratory facilities, where 
they were kept frozen pending subsampling. Process measurements were conducted 
on core splits from about 45-60 cm below the surface. 

Sample processing. For incubation studies, frozen soils were sectioned using a com- 
pound mitre saw. Before sectioning, cores were scraped of surface organic matter 
and other contaminants that froze, or otherwise adhered to the core during the 
drilling and extraction process. Frozen soil or peat aliquots were split from each 
core section for downstream analyses using a band saw to divide sections along the 
vertical profile. Core splits were thawed in a glove box under anaerobic conditions 
(1-2% H, atmosphere volume balanced with N,) and homogenized by hand. Core 
splits were allocated either to standard soil analyses, for example moisture, pH and 
total CN content, or incubation units for sequential determination of Fe(III)/Fe(I]), 
SO,?-,NO;_, denitrification potential, as well trace gas production (CO2, CH, and 
N20) by aerobic and anaerobic pathways. 

Soil analyses. Soil moisture was determined gravimetrically by drying samples toa 
constant mass at 65 °C. These subsamples were subsequently powdered in a modified 
roller mill and used to measure soil organic carbon and total nitrogen by com- 
bustion analysis on a LECO 2000 CNS Autoanalyzer calibrated with National Insti- 
tute of Standards and Technology standards Buffalo River sediment and apple leaves 
(SRM 2704 and SRM 1515, respectively). Owing to the acidity of these samples, 
carbonate removal was not necessary before analysis. For gamma samples, soil pH 
was determined from a 1:2 (mass:volume) slurry of oven-dried soils and deionized 
water which was allowed to equilibrate at room temperature for 30 min. For beta 
samples, we measured pH on pore water collected in July using acid-washed (30 cm 
X 6.0mm internal diameter), stainless steel sediment pore water samplers con- 
nected to 60 ml syringes fitted with three-way Luer-lock stopcocks. The samplers 
were flushed with several volumes of pore water before collecting and filtering 
(0.45 um nylon) samples into sterile 50 ml polyethylene centrifuge tubes. 

Rates of iron reduction, and sulphate and nitrate utilization. We established 
two series of six incubation units representing six sampling periods (0, 1, 3,5, 8 and 
12 d) for each of the forest and permafrost samples, and two series of four incuba- 
tion units (0, 1, 5, 12 d) for each bog soil core. An additional incubation unit con- 
sisting of autoclaved soil was established for each bog core section, and harvested at 
12d. Incubation units consisted of about 5 g of homogenized whole soil or peat, 
subsampled under anaerobic conditions into 40 ml borosilicate glass vials fitted 
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with 3.18 mm polypropylene septa (Environmental Sampling Supply). Each series 
was incubated in the dark at 5 or 22°C under a microaerobic atmosphere main- 
tained by a vacuum desiccator purged with five volumes of argon and held at a 
tension slightly below ambient pressure. On the appropriate sampling date, one 
vial from each series was destructively harvested by adding 15 ml of O2-free deion- 
ized water to the vial and vortexing at maximum speed for 60s. Pore water and 
solids were separated by centrifugation (15 min at 500g) and subsampled in the glove 
box to minimize potential for oxidation. Pore waters were filtered through a sterile 
0.2 um syringe filter into 15 ml polyethylene centrifuge tubes and stored frozen at 
—25 °C pending analysis for sulphate and nitrate content. Sulphate concentrations 
were determined using a Dionex DX ion chromatograph equipped with a 4mm 
AS4A-SC column (Dionex Corporation). Peak areas for sulphate were quantified 
against a six-point calibration (0.5-20.0 pg ml‘) developed from a commercial 
1,000 pg ml! stock SO, solution (Ultra Scientific). Nitrate content was analysed 
colorimetrically using an automated discrete photometric analyser (Aquakem 250, 
Thermo Fisher Scientific) following United States Environmental Protection Agency 
method 353.1. Changes in SO, and NO3 over time were used as estimates of re- 
duction rates. 

Rates of acid-extractable ferric iron (Fe(III)) reduction were estimated at mul- 
tiple time points by measuring ferrous iron (Fe(II)) production colorimetrically 
using the ferrozine method". Briefly, 0.5 g of homogenized whole soil, or peat, was 
transferred under anaerobic conditions into a 15 ml plastic centrifuge tube. We 
added 10 ml of 0.5 M HCl to the tube, and vortexed the mixture at maximium speed 
for 15 s. Tubes were shaken in the dark for 1 h at 200 revolutions per minute, and 
then centrifuged at 2,500g for 15 min. A 100 pil subsample of the supernatant was 
added to 5.0 ml of ferrozine solution (1.0 g1”' 3-(2-pyridyl)-5,6-diphenyl-1,2,4- 
triazine-4', 4”-disulphonic acid sodium in 50 mM HEPES (4-(2-hydroxyethyl)-1- 
piperazineethanesulphonic acid) adjusted to pH 4) and vortexed for 10 s. Absorbance 
was measured at 562 nm after allowing a full minute for colour development. To 
determine Fe(III), we subsequently added 0.25 ml hydroxylamine-HCl (0.25 M in 
0.25 M HCl) to each tube, and re-read absorbance at 562 nm, after allowing 15 min 
for the reduction of Fe(III) to Fe(II). We used a six-point calibration of Fe(II) stan- 
dards (2-80 pg ml~') to determine acid-extractable iron fractions. Amorphous 
ferric iron content was calculated as the difference between total acid-extractable 
iron and ferrous iron content before the addition of hydroxylamine. Fe(II) pro- 
duction over time was used as an estimate of reduction rate. 

Denitrification rate measurements. We employed a modified acetylene inhibi- 
tion technique’* to evaluate rates of denitrification among soil and peat replicates. 
In an anaerobic glove box, we placed about 3.0 ml of material (soil or peat) under 
field-moist conditions into two series of six serum vials (13 ml volume), represent- 
ing five sampling periods and a sterilized control, for each core section. Vials were 
crimp-sealed with butyl rubber stoppers, with the exception of sterilized controls, 
which were autoclaved (121 °C and 250 kPa) for 30 min, and allowed to cool in the 
glove box before sealing. Vials were amended with a 15% headspace addition of 
acetylene gas and inverted several times to improve diffusion of the acetylene. Each 
series of vials was incubated statically at 5 or 22 °C before sampling for NO. Sam- 
pling times were 10 min and 1, 2, 3 or 5 days after the addition of acetylene. At the 
appropriate sampling time, NO concentrations were measured by injecting 0.25 ml 
of headspace gas into an HNU model 301 gas chromatograph fitted with a Valco 
®°Ni electron-capture detector and a 2.6 m (1.5 mm internal diameter) Porapak R 
100/120 mesh column. Peak detection for NO was calibrated with certified N,0- 
in-N, standards with concentrations of 0.101, 1.02 and 10.1 ppm by volume (Air 
Liquide America Speciality Gases). Denitrification rates were determined by cal- 
culating the change in N,O over time. 

Methane oxidation. Soils were collected from the bog and black spruce forest 
APEX sites in Bonanza Creek Long-Term Ecological Research on 18 June 2009. At 
each site, soil cores were taken every 10 m along 40 m transects with five soil cores 
(each core representing a replicate) collected along each transect. Bog and black 
spruce forest cores were collected down to the ice layer using a corer with a dia- 
meter of 4 feet. The living vegetation top layer was removed from each core. Bog 
cores were approximately 30 cm deep (measured from below the vegetative layer 
to the ice layer), and each core was split up into three depth samples: 0-10 cm, 10- 
20 cm and 20-30 cm. Only data from 20-30 cm are reported here. Forest cores 
were approximately 10 cm deep. Cores were kept on ice and transported to the Uni- 
versity of California, Berkeley, where they were stored at 4°C until incubation 
experiments were initiated. 

CH, oxidation incubations. For CH, oxidation incubations, soils were delivered 
(10.0 + 0.1 g) to sterile 250 ml jars. The jars were sealed with airtight lids contain- 
ing septa to allow for headspace gas sampling. The initial headspace was composed 
of room air. To ensure that oxygen was not depleted during the CH, oxidation 
incubations, at the end of the incubations, headspaces were analysed for O2 concen- 
tration, using a Probe YSI 52 oxygen meter. The final average headspace O, con- 
centration was 19.2 + 1.0%. For CH, production incubations, soils (10 g) were 
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loaded into 100 ml serum bottles, and bottles were sealed with gas-impermeable 
blue butyl rubber septa (Bellco). Anoxic conditions were created using a standard 
alternating vacuum and pressure approach (for example refs 16, 17). Briefly, bottle 
headspaces were evacuated by vacuuming to 51 kPa and then over-pressured with 
N; to 40 kPa, and this process was repeated ten times. On final filling, headspace 
was only filled to 13 kPa with N>. The headspaces were then flushed with N2/CO 
(98/2%) at 135 kPa for 5 min through the septa using 23-gauge needles as input and 
exhaust ports. Headspace pressure was equilibrated to 10 kPa using a N> flushed 
syringe. For the anaerobic work, all transfers, additions and headspace samplings 
were done using gas-tight syringes fitted with stopcocks that were first degassed 
with N, to avoid O, contamination. For all anaerobic work, Nj and N2/CO) were 
first passed through hot copper fillings (~350 °C) to remove traces of O, (for ex- 
ample ref. 18), and the copper fillings were reduced daily using Hp. All incubations 
were setup with five replicates and with heat killed controls (autoclaved twice at 
121°C for 20 min). Methane (1,700 ppm) was delivered to all treatments. Soils 
were incubated for 5 days at 11°C in the dark. 

CH, oxidation gas analysis. Headspace samples (2 ml) were taken from incuba- 
tion jars via gas tight syringe fitted with a stopcock at t = 0 (30 min after CH, was 
delivered), 24h and 5 d. Headspace samples were transferred to 10 ml serum bottles 
sealed with thick black butyl rubber stoppers. Samples were analysed using an 
HP6890 GC (Hewlett Packard) fitted with a flame ionization detector and pulsed 
discharge ionization detector for CH4 and CO; analysis, respectively. CH4 oxida- 
tion and production rates were calculated by difference in total CH, concentration 
in the headspace at the beginning and end of the incubation period. 

Laboratory incubation experiment and the rate of trace gas production. Trace 
gas (CH4, CO2, N2O) production was monitored under aerobic and anaerobic con- 
ditions during a long-term (1 year) laboratory incubation experiment. We added 
1-7 g (dry mass) of soil or peat, with field-moisture content, to 100 ml clear boro- 
silicate glass serum vials (Sigma Aldrich) fitted with either 20 mm grey (aerobic) or 
blue (anaerobic) butyl rubber stoppers (Bellco Glass) and aluminium crimp seals. 
All vials were sealed and pre-incubated for 3 days in the dark before gas measure- 
ments were initiated. To prevent inhibition of respiration due to excessive CO, 
build-up, aerobic vials were capped for only 24 h before gas flux measurements, then 
aerated before being returned to their respective incubation chambers. Between 
sampling periods, these vials were covered with 0.8 mm polyethylene sheeting to 
prevent excessive moisture loss while still allowing gas exchange. Anaerobic vials 
were flushed with argon for 10 min after measurement to reset headspace atmo- 
sphere for the next sampling period. After gas sampling, the water content of all 
vials was checked gravimetrically and adjusted, as necessary, to the original field- 
moisture content with O,-free deionized water. Incubations were conducted at 5 °C 
and room temperature (22 °C). We sampled the headspace of all vials weekly for 
4 weeks. We removed 5 ml from the headspace of each vial using an air-tight sy- 
ringe, fitted with a Luer-lock stopcock sealed with high-performance vacuum grease, 
and measured CO, CH, and N,O with a gas chromatograph pre-configured for 
greenhouse gas monitoring (SRI 8610C gas chromatograph with flame ionization 
detector-methanizer and electron capture detector, column 3.2 mm internal dia- 
meter X 6m HaySep D, 1 ml sample loop; SRI Instruments). Peak areas for trace 
gases were calibrated using a three-point curve (100, 1,000 and 10,000 ppm) for 
CO,, a three-point curve (10, 100 and 1,000 ppm) for CH, and a two-point curve 
(0.101 and 1.02 ppm) for N,O. Calibration curves were developed with certified 
gas standards balanced with N2 (Air Liquide America Speciality Gases). 
Statistical analysis for soil edaphic characteristics and process rates. One-way 
analysis of variance was used to compare moisture content, pH, electrical conduc- 
tivity, dissolved organic carbon and total dissolved nitrogen among active layer, 
permafrost and bog soils (n = 4-5). In the cases where data were not normally dis- 
tributed or could not be transformed to a normal distribution, non-parametric 
Wilcoxon tests were used. A P value of 0.05 was used as the significance level. For 
Fe(II) production, SO, loss, NO3 loss, denitrification and CH, flux at 4 weeks under 
anaerobic conditions, aerobic CO, production at 4 weeks and anaerobic CO, pro- 
duction at 4 weeks, two-way analysis of variance was used with soil type and incu- 
bation temperature as the independent variables. In all cases, except for NO; loss, 
rates were higher with increased temperature. In Table 1, we present the main 
effect of ‘soil type’ and its mean and standard error. For each soil X temperature 
combination, n = 5. 

Sample preparation for nucleic-acid extraction. To ensure that the samples were 
free of surface contamination before further processing for microbial analyses, all 
surfaces of the frozen sections were removed before nucleic-acid extraction. The 
permafrost soil cubes were sprayed with 1.82 X 10” fluorescent microbeads (Fluo- 
resbrite Yellow Green latex microbeads, 0.5 um, Polyscience) to detect removal of 
the possibly contaminated surface soil of the cubes. The beads were washed three 
times with 70% RNase-free ethanol and twice with diethylpyrocarbonate (DEPC)- 
treated Milli-Q water before use. All tools were sterilized by autoclaving twice fol- 
lowed by baking at 120°C overnight. The cubes were sawn with an electronic 


jigsaw on a dry ice bed using sterile blades. Sterile, chilled clamps were used to hold 
the cube as it was sawn. After cutting a side of the cube, the clamps were soaked in 
10% bleach followed by sterile Milli-Q water and finally wiped with 70% ethanol 
before using them for sawing the new surface. During the sawing, the dry ice was 
covered with sterile and RNase-free foil that was changed after sawing each of the 
cube’s sides. After sawing, samples were taken from the newly exposed surface 
with a sterile scalpel and examined by epifluorescence microscopy for the presence 
of the fluorescent beads. Acridine orange direct staining and epifluorecence micro- 
scopy was conducted on all samples; cell density in permafrost was found to be 
approximately 1 10%, and in bog and active layer was twice that amount: approx- 
imately 2 X 10° cells per gram of soil (data not shown). 

Subsamples were taken from the cleaned and frozen soil cubes, homogenized while 
frozen and nucleic acids were extracted from three replicate 500 mg of soil from 
each cube. Nucleic acids were extracted using the hexadecyltrimethyl ammonium 
bromide (CTAB) and phenol-chloroform and bead beating protocol (see ref. 3) 
with a modification of adding a pressure lysis step before bead beating with a 
Barocycler (Pressure Biosciences) set to alter pressure from ambient to 2,413 bar 
for 20 s and back to ambient for 10 s, 20 times. After extraction, nucleic acids were 
purified with the AllPrep kit (Qiagen) according to the manufacturer’s instructions, 
with DNase treatment for the RNA samples. The purified replicates of the DNA and 
RNA samples were pooled and the resulting yield and quality were measured with 
a Bioanalyzer (Agilent). 

The V6-V9 area of the 16S rRNA gene was amplified at the Joint Genome Insti- 
tute (JGI) with primers 926wF (aaa ctY aaa Kga att gRc gg) and 1406R creating an 
amplicon of approximately 480 base pairs (bp). The reverse primer included a 5-bp 
barcode for multiplexing of samples during sequencing. The sequence reads were 
de-noised using Pyrotagger’’. Sequence reads were assigned into OTUs with 97% 
similarity. The representative from each OTU (97%) was classified by comparing 
with the Greengenes database (version 29nov2010). Potential chimaeras were removed 
using Pyrotagger and 69 OTUs were detected as possibly chimaeric and removed 
from the data set. 

As the number of sequence reads recovered from each sample varied, the sample 
data sets were rarified before alpha diversity analysis using the number of reads in 
the lowest sample. The microbial diversity did not vary drastically between the dif- 
ferent samples, showing the frozen and thawed sites to carry a diverse microbial 
community by the OTU number and chaol and ACE diversity estimates (Extended 
Data Table 2). 

Alpha diversity indices were calculated with R statistical calculation software pack- 
age vegan (http://cran.r-project.org/web/packages/vegan/index.html)” and in QUIME”! 
using a rarified data set as the number of sequence reads per sample varied. 

The OTU matrix with the relative abundance of each OTU was used to build a 
rank abundance curve of the 20 most abundant OTUs in all samples (Extended 
Data Fig. 5). The 454 reads have been deposited in the Sequence Read Archive under 
BioProject number PRJNA222786. 

Metagenomes. Two samples from each of the three sample types were metagen- 
ome sequenced: active layer samples A2 and AS, permafrost samples P1 and P3 and 
thermokarst bog samples B3 and B4. The quantity of the extracted DNA from the 
permafrost samples was not adequate for Illumina sample preparation and thus the 
DNA was emulsion PCR amplified as previously described’ after shearing it into 
fragments of about 300-500 bp. Briefly, 10 ng of DNA was sheared with a Covaris 
shearing apparatus and Illumina sequencing adapters were ligated to the end- 
repaired fragments. The fragments were amplified with the adaptor primers in an 
emulsion so that each fragment was amplified in an individual oil droplet. After 
amplification, the correct size fragments were extracted from a low-melting elec- 
trophoresis gel; the purified fragments were sequenced using the Illumina GAII 
sequencing platform (Illumina) for two cycles, generating 2 x 113 bp paired-end 
reads. 

Assembly. The metagenomic reads were assembled using the de novo assembler at 
CLC Genomics Workbench (CLC bio). The default parameters were used except the 
minimum length of contigs, which was 1,000. CLC assembler resulted in assembly 
of 18.8-59.8% of reads with an N50 from 890 to 1,117 bp. 

All of the contigs and the individual reads were submitted to the Integrated Mi- 
crobial Genomes and Metagenomes (IMG/M, US Department of Energy”) for gene 
calling and annotation. Additionally, the reads were annotated in MG-RAST” 
against the SEED subsystems database with an e value of 10 >. 

Binning draft genomes from MG. Reads were mapped to contigs using BWA. 
Contigs greater than 2.5 kilobases in length were grouped into genome bins on the 
basis of abundance and tetranucleotide frequency using MetaBat (https://bitbucket. 
org/berkeleylab/metabat). To account for the fact that our data represent complex 
communities and to minimize contamination, we used the very specific option, 
which specifies a probability cutoff of 90% for imminent neighbours, 90% for sec- 
ondary neighbours and 90% for tertiary neighbours. All other parameters were set 
to default. Completeness and contamination of genome bins were assessed using 
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CheckM”, which identified lineage-specific single-copy marker genes in each bin. 
Genome completeness was estimated using number of marker genes present in 
genome and contamination was evaluated from the number of multi-copy marker 
genes. Bins that were less than 20% complete or with greater than 10% contami- 
nation were discarded. AMPHORA2 was used to evaluate taxonomy. A confidence 
score for each gene was generated at each rank of classification**. Marker lineage 
was reported if 75% of the classifications were in agreement at a particular taxo- 
nomic level. Genome-to-Genome Distance Calculator"! was used for selected binds 
to calculate the DDH similarity. The annotated bins are stored at IMG/M under 
Study ID Gs0063124. 

Metatranscriptomics. Approximately 14 ng of RNA was used in first-strand syn- 
thesis of complementary DNA (cDNA) with 100 nM of primer N6-T7 (5’ NNNN 
NN-T7) and Superscript III First-Strand kit (Invitrogen) at 37 °C without the last 
denaturation step”® and with 200 1M of random hexamers. An Invitrogen Second- 
Strand synthesis kit was used for second-strand synthesis at 16°C followed by 
purification with Qiagen PCR purification columns. The purified cDNA was am- 
plified with the Ambion aRNA amplification kit (Ambion) according to the man- 
ufacturer’s protocol. The resulting cDNA was purified on Qiagen PCR columns 
and eluted to RNase free water. The cDNA was quantified and qualified with Bio- 
analyzer Pico RNA chips (Bioanalyzer). 

For the permafrost samples, 10 ng of linear amplified cDNA was used for Illumina 
library preparation through emulsion PCR as with metagenomes. The active layer 
and thermokarst samples were used directly in the Illumina library preparation at 
the JGI sequencing facility using the manufacturer’s protocols. One Illumina HiSeq 
lane was sequenced per sample with 150 bp for two cycles generating 2 X 150 bp 
paired-end reads. Individual reads were annotated at IMG/M and MG-RAST 
according to the parameters used for metagenomic reads. Metagenomic and meta- 
transcriptomic read annotations can be viewed at IMG/M under following sub- 
mission numbers: sample P1-MG, 10301 and 10302; P3-MG, 10298 and 10299; 
B3-MG, 10560; B4-MG, 10561 and 10562; A2-MG, 10308 and 10559; A5-MG, 
10305 and 10306; P3-MT, 10300; B3-MT, 10304; B4-MT, 10303; A2-MT, 10309; 
A5-MT, 10307. 

Metagenomic and metatranscriptomic annotations can be viewed at IMG/M 
under Study ID Gs0063124. 

Metaproteomics. Proteins were extracted using the SDS lysis-based method with 
trichloroacetic acid (TCA) precipitation and cleanup step followed by 10 kDA spin 
filter, modified from ref. 9. Briefly, 5 g of frozen sample were suspended in 5 ml of 
SDS lysis buffer (4% SDS, 100 mM Tris-HCl, pH 8.0) and boiled for 5 min; this pro- 
cedure presumably shuts down all cellular processes and captures the metapro- 
teome in situ. Cells were further disrupted by sonication pulses of 10s on, 10 off, 
for 2 min. After the second 5 min boiling step the samples were centrifuged and 
dithiothreitol (DTT) was added to a final concentration of 24 mM to the super- 
natant. The samples were precipitated with 20% TCA overnight at —20 °C. Precip- 
itated samples were centrifuged at 10,000g for 40 min at 4 °C and the pellets were 
washed with ice-cold acetone. After mixing by vortexing, the samples were cen- 
trifuged for 5 min at 4°C at 21,000g. Protein pellets were solubilized in 8 M urea 
with 100 mM Tris-HCl, pH 8.0, by incubation at room temperature (27 °C) for 
30 min and sonicated in an ice-water bath to keep the samples below 37 °C. Sam- 
ples were adjusted to 20 mM iodoacetamide and incubated in the dark for 15 min at 
room temperature. Trypsin digestion and peptide clean-up were performed as in 
ref. 27; the resultant peptide solutions were split into four replicates and frozen at 
—80°C until analysis. 

Measurement of peptides by two-dimensional liquid chromatography-tandem 
mass spectrometry. Complex peptide samples from each soil type were pressure 
cell loaded onto a biphasic two-dimensional back column packed with 5 cm strong 
cation exchange resin for charge-based separation of peptides followed by 3 cm C18 
reversed phase for online desalting (Luna and Aqua respectively, Phenomenex). 
Once loaded, the sample columns were washed offline with solvent A (5% acet- 
onitrile, 95% high-performance liquid chromatography (HPLC)-grade water, 0.1% 
formic acid) for 15 min to remove residual urea and NaCl followed by a gradient to 
100% solvent B (70% acetonitrile, 30% HPLC-grade water, 0.1% formic acid) over 
30 min to move the peptide population from the reversed phase to the strong cation 
exchange resin. Washed samples were then placed in-line with a nanospray emitter 
(100 um internal diameter, New Objective) packed with 15cm of C18 reversed 
phase material (Aqua) and analysed in duplicate by 24 h two-dimensional liquid 
chromatography-tandem mass spectrometry (MS/MS) (11 salt-pulses: 5, 7.5, 10, 
12.5, 15, 17.5, 20, 25, 35, 50, 100% of 500 mM ammonium acetate followed by a 
100 min gradient to 50% solvent B) with a hybrid Velos/Orbitrap mass spectro- 
meter (Thermo Fisher Scientific) operating in data-dependent mode. Full MS1 
scans were obtained using an Orbitrap mass analyser set to 330,000 resolution, while 
20 data-dependent MS/MS scans were acquired per full scan (two microscans). 
Data-dependent MS/MS scans were acquired in the Velos linear ion trap, with two 
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microscans averaged and dynamic exclusion set to on. Two replicate measurements 
were obtained for each sample, except sample B4 which had one measurement. 
Database searching and protein identification. FASTA databases were compiled 
with the protein sequences of the assembled and annotated metagenomes (active 
layer, permafrost and thermokarst bog), 180 proteomes of known soil bacteria (Sup- 
plementary Table 2) and known protein contaminants (trypsin, keratin, etc.) into 
one single composite database. Assembled metagenomes used for proteome searches 
were submitted to IMG under the following submission numbers: permafrost, 2321; 
active layer, 2382; thermokarst bog, 2383. 

All MS/MS spectra were searched as in ref. 27 with the SEQUEST algorithm 
against the composite FASTA database and filtered with DTASelect/Contrast at 
the peptide level (Xcorrs of at least 1.8 (+1), 2.5 (+2), 3.5 (+3)). Only proteins 
identified with two fully tryptic peptides, with one unique to that specific protein 
entry, from a 22h run were considered for further biological study. 
Metaproteomics data analysis. To prepare for semiquantitative proteome ana- 
lysis, DTASelect-filtered data were subjected to spectral count balancing and subse- 
quent normalized spectral abundance factor (NSAF) determination”. For the former, 
the unique status of each identified peptide was assessed. If a peptide was deemed 
unique, it retained 100% of its previously assessed spectral count. However, if a 
peptide was found to belong to more than one representative protein, namely non- 
unique, its spectral count was re-calculated on the basis of the ratio of uniquely 
identified peptides between the two or more proteins that shared the non-unique 
peptide in question. The adjusted spectral count of the proteins incorporated these 
balanced values, which corrected for the slight quantitative bias that occurs with 
homologous proteins and/or proteins with homologous regions, both of which 
artificially inflate spectral count values at the expense of proteins that share no 
homology. Once spectrally balanced, NSAF values were calculated for each pro- 
tein in a specific run to normalize individual MS runs on the basis of the total 
number of spectra collected and protein length, which by itself introduces a bias 
that favours larger proteins if not corrected for. Summed technical replicates and 
NSAEF values were used for subsequent analysis. A Venn diagram was drawn for 
the shared and unique proteins present in three out of four technical replicates 
in each soil. The mass spectrometry proteomics data have been deposited in the 
ProteomeXchange Consortium via the PRIDE partner repository with the data set 
identifier PXD001131. 

Proteome search to additional genomes. Microbial proteins from 13 recently 
sequenced species isolated from cold environments (Supplementary Table 2) were 
collected from NCBI RefSeq and JGI in FASTA format. With these protein sequences, 
we searched MS/MS peaks of our data using MyriMatch 2.1.138. We set MyriMatch 
options as MonoPrecursorMzTolerance = 3 m/z, MonoisotopeAdjustmentSet = 0, 
FragmentMzTolerance = 0.8 m/z, CleavageRules = Trypsin/P, MinTerminiCleav- 
ages = 1, ComputeXCorr = true and MaxResultRank = 10. We also added static 
carbamidomethyl modification on Cys by adding 57.021462 Da. We ran MyriMatch 
with 48 central processing units at the Newton cluster of the University of Tennessee, 
Knoxville (https://newton.utk.edu/). PepXML files, produced by MyriMatch, were 
analysed by in-house Python scripts to get proteins from peptide-to-spectrum 
matches as was done by DT ASelect in the original search. Spectrum counts of every 
protein were balanced and normalized by NSAF as was done in the original search. 
BLASTP was used to compare protein sequences between the original database and 
13 new species. We used default options of BLASTP and accepted BLAST hits that 
their matched length was bigger than 95% length of a query protein and had no gap 
opens. 

Data analysis. In addition to IMG/M annotation, a BLAST search was conducted 
on the metagenomic and metatranscriptomic reads against the Greengenes data- 
base (version 29no0v2010); one best hit was chosen for each read, with an e value 
cutoff of 10~° to focus on the 16S rRNA gene profiles. Since the Illumina HiSeq 
runs produced over 20 Gb of data, the reads were pre-analysed with the Ribopicker 
program to collect only the presumably 16S rRNA origin reads, which were further 
classified with BLAST. The relative abundance of each phylum in MG and MT data 
sets was compared in STAMP” with Fisher’s alpha test and Storey’s false discovery 
rate correlation. MT/MG data were compared with a two-tailed t-test. 

Triplots based on the relative abundance, normalized read annotations were cal- 
culated in R package ved (http://cran.r-project.org/web/packages/vcd/index.html)*° 
and barcharts were drawn for selected cycles using the normalized data. The heat- 
map focusing on the transporters was obtained from the top 40 (relative abund- 
ance) MT reads that were assigned as transporters. A Venn diagram (Extended Data 
Fig. 2) was constructed from the selected genes present in the three sample types 
(permafrost, active layer and bog) for all three data sets (MG, MT and MP). 

The three different omics approaches were compared (Extended Data Fig. 6) for 
each site by extracting the Kyoto Encyclopedia of Genes and Genomes (KEGG) hits 
for MG, MT and MT data sets and comparing these in ternary plots as described 
above; the MT and MG data sets were multiplied by 100 to make the values 
comparable. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


14. 
15. 


16. 
17. 
18. 
19. 
20. 
21. 
22. 


Lovley, D. R. & Phillips, E. J. Rapid assay for microbially reducible ferric iron in 
aquatic sediments. Appl. Environ. Microbiol. 53, 1536-1540 (1987). 

Balderston, W. L., Sherr, B. & Payne, W. J. Blockage by acetylene of nitrous oxide 
reduction in Pseudomonas perfectomarinus. Appl. Environ. Microbiol. 31, 504-508 
(1976). 

Balch, W. E., Fox, G. E., Magrum, L. J., Woese, C. R. & Wolfe, R. S. Methanogens: 
reevaluation of a unique biological group. Microbiol. Rev. 43, 260-296 (1979). 
Zehnder, A. J. & Brock, T. D. Anaerobic methane oxidation: occurrence and 
ecology. Appl. Environ. Microbiol. 39, 194-204 (1980). 

Macy, J. M., Snellen, J. E. & Hungate, R. E. Use of syringe methods for anaerobiosis. 
Am. J. Clin. Nutr. 25, 1318-1323 (1972). 

Kunin, V. & Hugenholtz, P. PyroTagger: a fast, accurate pipelinde for analysis for 
analysis of rRNA amplicon pyrosequencing data. Open J. 1, 1-8 (2010). 
Oksanen, J., Kindt, R., Legendre, P. & O’Hara, R. B. Vegan: Community Ecology 
Package v.1.8-2 (2006). 

Caporaso, J. G. et al. QIIME allows analysis of high-throughput community 
sequencing data. Nature Methods 7, 335-336 (2010). 

Markowitz, V. M. et a/. IMG: the Integrated Microbial Genomes database and 
comparative analysis system. Nucleic Acids Res. 40, D115-D122 (2012). 


23. 


24. 


25. 


26. 


27. 


28. 


29. 
30. 


Meyer, F. et al. The metagenomics RAST server — a public resource for the 
automatic phylogenetic and functional analysis of metagenomes. BMC 
Bioinformatics 9, 386 (2008). 

Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson G. W. CheckM: 
assessing the quality of microbial genomes recovered from isolates, single cells, 
and metagenomes. PeerJ PrePrints 2, e554v1 (2014). 

Wu, M. & Scott, A. J. Phylogenomic analysis of bacterial and archaeal sequences 
with AMPHORA2. Bioinformatics 28, 1033-1034 (2012). 

Leininger, S. etal. Archaea predominate among ammonia-oxidizing prokaryotes in 
soils. Nature 442, 806-809 (2006). 

Giannone, R. J. et al. Proteomic characterization of cellular and molecular 
processes that enable the Nanoarchaeum equitans-lgnicoccus hospitalis 
relationship. PLoS ONE 6, e22942 (2011). 

Florens, L. & Washburn, M. P. Proteomic analysis by multidimensional protein 
identification technology. Methods Mol. Biol. 328, 159-175 (2006). 

Parks, D. H. & Beiko, R. G. Identifying biologically relevant differences between 
metagenomic communities. Bioinformatics 26, 715-721 (2010). 

eyer, D., Zeileis, A. & Hornik, K. vcd: Visualizing Categorical Data v.1.2-13 
(2012). 


©2015 Macmillan Publishers Limited. All rights reserved 


Pormatrost 


b Thermokarst bog 


Permafrost 


“eee @e oe 


Permafrost 


Extended Data Figure 1 | Comparison of multi-omics data between 

the three zones. The triplots visually present selected functions in 

(a) metagenomes, (b) metatranscriptomes and (c) metaproteomes in the three 
studied zones (permafrost, active layer and thermokarst bog). The colours 
correspond to SEED subsystems categories in MG-RAST; in addition, the 
closer the symbol is to the node of the triplot, the more abundant the gene, 
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Extended Data Figure 2 | Shared and unique genes, transcripts and proteins in the three zones. The Venn diagrams for (a) metagenome, 
(b) metatranscriptome and (c) metaproteome data illustrate the shared and unique genes, transcripts and proteins in the three zones. 
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Extended Data Figure 3 | Heatmap showing the 40 most abundant transporters found in the metatranscriptomes. 
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Extended Data Figure 4 | Abundance of genes involved in sulphate reduction, nitrogen cycle, methanogenesis and methane oxidation. The relative 
abundance of each gene is shown; error bars, s.d. The genes with protein matches in the metaproteome data are marked with a triangle. 
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permafrost and a draft genome bin was obtained that corresponded to 
this OTU (Extended Data Table 1). The abundant archaeal OTUs 
were methanogens. 
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Extended Data Table 1 | Metagenomic bins produced from the metagenomes 
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Extended Data Table 2 | Number of OTUs from 454 sequencing and alpha diversity indices 
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Pathogen-secreted proteases activate a novel plant 


immune pathway 


Zhenyu Cheng’*, Jian-Feng Li'**+, Yajie Niu'?, Xue-Cheng Zhang’, Owen Z. Woody’, Yan Xiong"*+, Slavica Djonovié!+, 
Yves Millet>?+, Jenifer Bush’, Brendan J. McConkey’, Jen Sheen’? & Frederick M. Ausubel’? 


Mitogen-activated protein kinase (MAPK) cascades play central roles 
in innate immune signalling networks in plants and animals’. In 
plants, however, the molecular mechanisms of how signal perception 
is transduced to MAPK activation remain elusive’. Here we report 
that pathogen-secreted proteases activate a previously unknown sig- 
nalling pathway in Arabidopsis thaliana involving the Ga, GB, and 
Gy subunits of heterotrimeric G-protein complexes, which function 
upstream of an MAPK cascade. In this pathway, receptor for acti- 
vated C kinase 1 (RACK1) functions as a novel scaffold that binds 
to the GB subunit as well as to all three tiers of the MAPK cascade, 
thereby linking upstream G-protein signalling to downstream acti- 
vation of an MAPK cascade. The protease-G-protein-RACK1-MAPK 
cascade modules identified in these studies are distinct from prev- 
iously described plant immune signalling pathways such as that elic- 
ited by bacterial flagellin, in which G proteins function downstream 
of or in parallel to an MAPK cascade without the involvement of 
the RACK] scaffolding protein. The discovery of the new protease- 
mediated immune signalling pathway described here was facilitated 
by the use of the broad host range, opportunistic bacterial pathogen 
Pseudomonas aeruginosa. The ability of P. aeruginosa to infect both 


a 
PA14 PA14 lasl 


plants and animals makes it an excellent model to identify novel im- 
munoregulatory strategies that account for its niche adaptation to 
diverse host tissues and immune systems. 

We found that culture filtrate of P. aeruginosa strain PA14 activates 
an Arabidopsis B-glucuronidase (GUS) reporter gene under the control 
of the pathogen-inducible CYP71A12 promoter (CYP71A12pro:GUS). 
Whereas the well-characterized immune elicitor flg22, a synthetic pep- 
tide that corresponds to the active epitope of bacterial flagellin, induces 
CYP71A12pro:GUS in the root elongation zone’, PA14 culture filtrate 
activates the reporter in the cotyledons and leaves of both wild-type 
Arabidopsis Col-0 and fls2 mutant seedlings in which the flagellin 
receptor is mutated (Fig. 1a). 

By screening a collection of 64 P. aeruginosa PA14 regulatory and 
secretion-related mutants, we found that the induction of the CYP71A12 
promoter was dependent on the quorum-sensing gene lJasI and on the 
type II secretion apparatus-encoding genes xcpR, xcpT, xcpW, and 
xcpZ (Fig. 1a and Extended Data Table 1). lon-exchange chromatogra- 
phy fractionation (Extended Data Fig. 1a) followed by mass spectro- 
metry (data not shown) identified the elicitor in the PA14 secretome as 
protease IV, a type II-secreted, PvdS-regulated lysyl class serine protease 
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Figure 1 | Proteases trigger innate immune responses in Arabidopsis via 
proteolytic activity. a, Activation of CYP71A12pro:GUS in wild-type 
Arabidopsis Col-0 or fls2 mutant cotyledons by culture filtrates from wild-type 
P. aeruginosa PA14, from PA14 mutants containing a transposon insertion in 
lasI or xcpR, or from PA14/AprpL. b, Western blot depicting activation of 
MAPKs by PrpL or flg22. Numbers on the left axis of the blot represent marker 
size (molecular mass in kilodaltons). c, Chemiluminescence assay showing 
elicitation of an oxidative burst by PrpL,; r.l.u, relative luminescence units. HK: 
100 nM ‘heat-killed’ PrpL. d, Callose formation in cotyledons elicited by 
PrpL or flg22 detected by aniline blue staining. e, Protection of 4-week-old 
Arabidopsis leaves from P. syringae pv. tomato strain DC3000 infection by 


pre-infiltrated PrpL; c.f.u., colony-forming units. f, Western blot depicting 
activation of MPK3 and MPK6 by PrpL and inactive variants of PrpL. The same 
molecular mass region of the blot is shown as in b. g, Western blot depicting 
activation of MPK3 and MPK6 by PrpL or ArgC or TLCK-treated PrpL or 
TLCK-treated ArgC. The same molecular mass region of the blot is shown as in 
b. h, Growth of X. campestris strains 8004/argC or 8004/vector in 3-week-old 
B. oleracea leaves. Data represent mean + s.d.; n = 16 individual seedlings 

(c) and n = 10 leaves from five plants (e, h); ***P < 0.001, Student’s t-test. 
The experiments in a and d were repeated three times with similar results and 
the representative images shown were selected from at least three images. 
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encoded by the P. aeruginosa prpL gene (PA14_09900). Purified His- 
tagged PA14 protease IV (referred to as PrpL in the figure legends) 
activated CYP71A 12pro:GUS (Extended Data Fig. 1b), whereas culture 
filtrate from an in-frame deletion mutant of prpL (PA14/AprpL) did 
not (Fig. 1a). 

Purified protease IV is a very strong elicitor of immune responses in 
Arabidopsis, comparable to flg22 in the activation of MPK3 and MPK6 
(but not MPK4) (Fig. 1b), elicitation of an oxidative burst (Fig. 1c), 
deposition of callose in cotyledons (Fig. 1d), and protection of adult 
Arabidopsis leaves from Pseudomonas syringae pathovar (pv.) tomato 
strain DC3000 infection (Fig. 1e). In contrast, trypsin, a well-characterized 
serine protease, failed to activate MAPK cascades or trigger an oxida- 
tive burst (Extended Data Fig. 2a, b). Global transcriptional profiling 
analysis (Extended Data Fig. 3a), confirmed by quantitative PCR with 
reverse transcription (RT-qPCR) analysis of selected defence-related 
genes (Extended Data Fig. 3b), showed a high degree of concordance 
between the genes activated or repressed by protease IV and genes pre- 
viously shown to be regulated by flg22 or oligogalacturonides in seedlings* 
(Pearson correlation coefficients of 0.899 and 0.864 for protease-IV- 
treated versus flg22 and oligogalacturonides, respectively). 

Importantly, protease IV variants containing alanine substitutions 
at the proteolytic catalytic triad site (PrpL"”*“, PrpL?!?*“, PrpL‘1°84), 
which exhibit no detectable proteolytic activity*, were impaired for MAPK 
activation (Fig. 1f), defence gene induction, and oxidative burst elici- 
tation (Extended Data Fig. 4a, d). Treatment of protease IV with the 
protease inhibitor TLCK (Fig. 1g and Extended Data Fig. 4b, d) or with 
heat (Fig. 1c and Extended Data Fig. 4c) also resulted in a loss of elic- 
itation ability. 

The closest homologue of P. aeruginosa protease IV in sequenced 
bacterial genomes is encoded by the argC gene of Xanthomonas cam- 
pestris, a bona fide plant pathogen (Extended Data Fig. 5a). Purified 
His-tagged ArgC protease exhibited protease activity in vitro and trig- 
gered the activation of MPK3 and MPK6 that is dependent on ArgC 
protease activity (Fig. 1g). 

We noticed that there is a high rate of naturally occurring null mu- 
tations in the Xanthomonas argC gene (8 out of 22 total alleles in se- 
quenced Xanthomonas genomes; Extended Data Fig. 5b-d), suggesting 
that argC is probably under negative selection. Consistent with the se- 
quence data, the culture filtrate of strain X. campestris pv. raphani strain 
1946, from which the functional argC gene was cloned, activated the 
CYP71A12pro:GUS reporter, whereas culture filtrates from two X. cam- 
pestris pv. campestris strains (8004 and BP 109), which contain presump- 
tive argC null frame shift mutations, failed to activate (Extended Data 
Fig. 5e). We complemented the null argC mutant in strain 8004 (Xcc 
8004) with the functional argC gene from strain 1946 (8004/argC) (Ex- 
tended Data Fig. 5e). Consistent with ArgC-mediated induction of a 
host immune response during an infection in a mature plant, the growth 
of 8004/argC in Brassica oleracea (broccoli), a natural host of X. cam- 
pestris, was reduced about sixfold compared with the 8004/vector con- 
trol (Fig. 1h). The expression of haemagglutinin (HA)-tagged ArgC was 
readily detected in broccoli leaves infected with 8004/argC (Extended 
Data Fig. 5e), indicating that ArgC is synthesized during infection. 

Next, we sought to investigate the mechanism by which protease IV 
activates an immune response in Arabidopsis. Previous studies have 
shown that G proteins play a role in microbe-associated molecular pat- 
tern molecule-mediated responses®. In the case of protease IV, we found 
reduced expression of defence-related genes in gu or gf mutants (and 
ina gy1gy2 double mutant), reduced levels of the oxidative burst in a ga 
mutant and a ga double mutant, reduced MPK3 and MPK6 activa- 
tion, and reduced protection against P. syringae infection in a gxf dou- 
ble mutant (Fig. 2a—c and Extended Data Fig. 6a, b). The induction of 
CYP71A12 and activation of MPK3 and MPK6 by X. campestris ArgC 
was also diminished in the G-protein mutants, similar to the pattern 
observed for protease IV (Fig. 2a, b). In contrast to protease IV and 
ArgC, in the case of flg22, defence gene expression was only reduced in 
gh and guf double mutants, the oxidative burst was more severely 
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Figure 2 | Protease-mediated defence responses are coupled to G-protein 
signalling. a, Western blot depicting activation of MAPKs by PrpL, ArgC, or 
flg22 in wild-type Col-0 or G-protein tDNA mutants. The same molecular 
mass region of the blot is shown as in Fig. 1b. b, Induction of defence-related 
gene expression by PrpL, ArgC, or flg22 in wild-type Col-0 or G-protein 
tDNA mutants measured by RT-qPCR. c, Protection of 4-week-old wild-type 
Col-0 or ga} double mutant leaves from P. syringae pv. tomato strain DC3000 
infection by pre-infiltrated PrpL or flg22; gpal-4 is a gx mutant, agb1-2 is 

a gf mutant, and gpal-4/agb1-2 is a gu double mutant. Data represent 
mean + s.d.; n = 3 biological replicates with each experiment containing eight 
seedlings (b) and n = 10 leaves from five plants (c); *P < 0.05; **P < 0.01; 
***P < 0.001, Student’s t-test versus Col-0 (b) and versus mock (c). 


gpat-4/agb1-2 


affected in a gf mutant than in a ga mutant, protection against P. syr- 
ingae was only modestly affected in a gx double mutant, and the acti- 
vation of MAPKs was not affected in any of the G-protein mutants 
(Fig. 2a—c and Extended Data Fig. 6b). These data show that G-protein 
signalling is required to activate downstream MAPKs in response to 
protease IV and ArgC, but not flg22 (Fig. 2a), and that G proteins play 
different roles in canonical microbe-associated molecular pattern mole- 
cule and protease-mediated signalling pathways. 

In a search of potential signalling components that could link the 
heterotrimeric G-protein complex to downstream MAPK cascades, we 
considered the conserved scaffold protein RACK1 (ref. 7). The ration- 
ale was that RACK] shares about 25% amino-acid sequence identity 
with GB and like GB has a seven-bladed B-propellor structure’, inter- 
acts with GB in metazoans*, and functions in innate immune signalling 
in rice’. There are three RACK1 homologues in Arabidopsis: RACK1A, 
1B, and 1C, which share about 90% amino-acid sequence identity’®. 

We used three methods to determine whether Arabidopsis RACK1 
proteins interact with G proteins and MAPKs. In a bimolecular fluor- 
escence complementation (BiFC) assay in Nicotiana benthamiana leaves, 
RACK1A, RACK1B, and RACKIC interacted with GB, MEKK1(K361M), 
MKK4, MKK5, MPK3, and MPK6, but not Ga or MPK4 (Extended 
Data Fig. 7a). The kinase-inactive version of MEKK1, MEKK1(K361M), 
was used in this experiment because the auto-activation of native MEKK1 
destabilizes its interaction with RACK1 (data not shown). MEKK1, 
MKK4/5, and MPK3/6 are the Arabidopsis MAPK kinase kinase (MA 
PKKK), MAPK kinases (MAPKKs), and MAPKs, respectively, that 
were proposed to constitute an MAPK-signalling cascade in the flg22/ 
FLS2 signalling pathway'’. Similar results to those obtained with the 
BiFC assay in N. benthamiana were obtained with BiFC and split fire- 
fly luciferase complementation (SFLC) assays for RACK1A interactors 
in Arabidopsis protoplasts (Extended Data Fig. 7b, c). The interaction 
between RACK] proteins and MPK3/6, but not MPK4, is consistent 
with the data in Fig. 1b, showing that MPK6 and MPK3, but not MPK4, 
are strongly activated after protease IV treatment. 

In co-immunoprecipitation experiments in Arabidopsis mesophyll pro- 
toplasts using Flag-tagged RACK] proteins as the bait and HA-tagged 
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Figure 3 | RACK1A interacts with GB and MAPKs. a-f, Co- 
immunoprecipitation (Co-IP) assays in Arabidopsis protoplasts. Protoplasts 
were treated with 100 nM purified PrpL for 15 min. Target proteins were 
detected in western blots using anti-HA or anti-Flag antibodies. Numbers on 
the left axis of blots represent marker size (molecular mass in kilodaltons). 


GB subunit as the prey, we observed binding between all three Arabi- 
dopsis RACK] proteins and Gf (Fig. 3a and Extended Data Fig. 7d). 
In contrast to GB, HA-tagged Ga was not pulled down by Flag- 
tagged RACK] proteins (Fig. 3b and Extended Data Fig. 7d). In the co- 
immunoprecipitation experiments, the interaction of GB with RACK1A 
was not dependent on Ga, because the interaction was still present in the 
ga} double mutant (Extended Data Fig. 7e). Finally, consistent with the 
BiFC and SFLC assays, HA-tagged MEKK1(K361M), MKK5, MPK3, 
and MPK6 all co-immunoprecipitated with Flag-tagged RACK1A, 
whereas MPK4 did not under the same condition (Fig. 3c-f). The 
amounts of the MAPKK and MAPKs that were pulled down by 
RACKIA in the co-immunoprecipitation experiments clearly decreased 
in the presence of protease IV (Fig. 3d-f), suggesting that protease IV 
releases the activated MAPKs from the RACK1-MAPK cascade com- 
plex to execute their downstream cellular functions. In the case of the 
MAPKKK MEKK1, we also identified endogenous RACK] proteins by 
mass spectrometric analysis as binding partners of MEKK1(K361M) 
(Extended Data Fig. 7f) in a transgenic line in which Flag-tagged 
MEKK1(K361M) is expressed under the control of the 3.9-kilobase (kb) 
MEKK!1 native promoter in a mekk1 null mutant background. 

To confirm the physiological relevance of the observed interactions 
between RACK] and MAPK cascade components (Fig. 3 and Extended 
Data Fig. 7), we tested a variety of loss-of-function MAPK mutants and 
knockdowns. We found that the activation of the defence-related genes 
WRKY30 and WRKY33 by protease IV was almost completely blocked. 
in two independent mpk3,6-es transgenic lines in which mpk3 is silenced 
with an oestradiol-inducible MPK3-RNA interference (RNAi) con- 
struct in a null mpk6 mutant background (Extended Data Fig. 8a). We 
also found that both protease-IV-triggered MPK3/6 activation and 
WRKY30 and WRKY33 gene induction were disrupted in mkk4,5-es 
transgenic lines (Extended Data Fig. 8a, b), which utilize a single 
oestradiol-inducible RNAi construct to target both MKK4 and MKK5 
messenger RNAs (mRNAs). Finally, we observed a significant decrease 
in protease-IV-triggered induction of WRKY30 and WRKY33 mRNA 
accumulation in two mekk1 mutants, an mekk1 null mutant, and the 
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mekk1 null mutant complemented with an MEKK1(K361M) construct 
(mekk 1/pMEKK1::MEKK1(K361M)) (Extended Data Fig. 8c, d). As pre- 
viously reported, MEKK1(K361M), which is deficient in kinase activity, 
rescues the severe growth defect of an mekk1 null mutant’. In contrast 
to the mkk4,5 knockdown lines, we did not consistently observe a de- 
creased level of protease-IV-triggered MPK3/6 phosphorylation in either 
of the mekk1 mutants (Extended Data Fig. 8e). One explanation for the 
partial decrease in WRKY gene induction but not in MPK3/6 phos- 
phorylation in the mekk1 mutants is that multiple MAPKKKs’* func- 
tion additively to activate MPK3/6 but that the phosphorylation assay 
is not sensitive enough to detect a partial loss of MAPKKK activity. 
Obtaining genetic evidence that RACK] is required for protease- 
mediated signalling is challenging because of the functional redundancy 
of the three RACK] proteins in Arabidopsis. Transfer-DNA (tDNA) 
mutants corresponding to insertions in individual rack1 genes did not 
show any decrease in protease-IV- or flg22-activated MAPK levels (Ex- 
tended Data Fig. 9a), and only moderate decreases in protease-IV- but 
not flg22-triggered defence gene induction (Extended Data Fig. 9b). Be- 
cause rack1la rack1b rack 1c triple null mutants have a dwarf phenotype 
and do not set seeds“, we generated two independent transgenic lines, 
amiR-rackl1-es1 and amiR-rack1l-es2, which express a previously 
described artificial microRNA (amiR-RACK1-4)’° under the control of 
an oestradiol-inducible promoter. These transgenic lines showed dra- 
matically decreased transcript levels of all three rack1 genes following 
oestradiol treatment (Extended Data Fig. 9c). Following protease IV or 
ArgC treatment, amiR-rack1-es1 and amiR-rack1-es2 seedlings that 
had been induced with oestradiol exhibited markedly decreased levels 
of activated MPK3 and MPK6 (Fig. 4a). Protoplasts transfected with 
constitutively expressed amiR-RACK1-4 also showed reduced levels 
of protease-IV-mediated MPK3 and MPK6 activation (Extended Data 
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Figure 4 | Transiently silencing all three rack1 genes abrogates proteases 
but not flg22-mediated responses. a, Three-day-old wild-type Col-0 and 
transgenic Arabidopsis seedlings from two independent amiR-rack1-es lines 
were treated with oestradiol to activate expression of the artificial microRNA 
constructs and then 2 days later were treated with PrpL, ArgC or flg22 and 
harvested for the MAPK phosphorylation assay. The same molecular mass 
region of the western blot is shown as in Fig. 1b. b, Seedlings were treated 
with oestradiol followed by PrpL, ArgC or flg22 as in panel a and then harvested 
for RT-qPCR analysis of CYP71A12 transcript levels. Water-treated Col-0 
was used as a normalization control. c, Protection of 4-week-old wild-type 
Col-0 and transgenic amiR-rack1-es1 or amiR-rack1-es2 plants from 

P. syringae pv. tomato strain DC3000 infection mediated by PrpL or flg22 24h 
after treatment with oestradiol. Data represent mean = s.d.; n = 3 biological 
replicates with each experiment containing 12 seedlings (b) and n = 10 leaves 
from five plants (c); **P < 0.01; ***P < 0.001, Student’s t-test versus 

Col-0 (b) and versus mock (c). d, A model of protease-activated novel innate 
immune signalling pathway in Arabidopsis. 
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Fig. 9d, e). Similarly, knockdown of the rack1 genes blocked protease- 
IV- or ArgC-mediated defence gene induction (Fig. 4b) and protease- 
IV-mediated protection against P. syringae infection (Fig. 4c). In contrast 
to protease IV and ArgC, flg22-mediated activation of MAPKs or de- 
fence gene expression or protection against P. syringae were not affec- 
ted by knockdown of the rack1 genes (Fig. 4a—c and Extended Data 
Fig. 9e). These data are consistent with the conclusion that RACK1 
proteins function in the protease IV and ArgC signalling pathway but 
not the flg22 pathway. 

The RACK] proteins studied here are the first MAPK cascade scaf- 
folding proteins discovered for the large family of plant genes encoding 
MAPK cascade components. In yeast, the scaffolding protein Ste5 links 
an MAPK cascade to G-protein signalling in the mating pathway that is 
mediated by G-protein-coupled receptor stimulation by yeast phero- 
mone’®. In mammals, the scaffolding protein B-arrestin 2 brings MAPK 
cascade activity under the control of upstream G-protein-coupled 
receptors’®. However, since plants do not have canonical G-protein- 
coupled receptors or orthologues of Ste5 and B-arrestin®’*, our data 
suggest that the linkage of G-proteins to MAPKs via RACK] is mech- 
anistically distinct from G-protein signalling in metazoans and yeast. 

The protease-activated signalling pathway is summarized in the model 
shown in Fig. 4d. It remains to be determined whether the cleavage of 
protein targets by protease IV directly or indirectly activates down- 
stream responses. In the latter possibility, pathogen-secreted proteases 
could release host polypeptides that function as damage-associated 
molecular patterns which are subsequently recognized by correspond- 
ing immune receptors. In either case, an evolutionary and physiological 
interpretation of our findings is that plants evolved a new surveillance 
system to recognize and respond to pathogen-encoded proteases that 
disrupt host homeostasis via their proteolytic activity. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 16 June 2014; accepted 15 January 2015. 
Published online 2 March 2015. 


1. Tena, G., Boudsocq, M. & Sheen, J. Protein kinase signaling networks in plant 
innate immunity. Curr. Opin. Plant Biol. 14, 519-529 (2011). 

2. Arthur, J. S. & Ley, S. C. Mitogen-activated protein kinases in innate immunity. 
Nature Rev. Immunol. 13, 679-692 (2013). 


216 | NATURE | VOL 521] 14 MAY 2015 


3. Millet, Y. A. et al. Innate immune responses activated in Arabidopsis roots by 
microbe-associated molecular patterns. Plant Cell 22, 973-990 (2010). 

4. Denoux, C. et al. Activation of defense response pathways by OGs and flg22 
elicitors in Arabidopsis seedlings. Mol. Plant 1, 423-445 (2008). 

5. Traidej, M., Marquart, M. E., Caballero, A. R., Thibodeaux, B. A. & O’Callaghan, R. J. 
Identification of the active site residues of Pseudomonas aeruginosa protease IV. 
J. Biol. Chem. 278, 2549-2553 (2003). 

6. Urano, D.,Chen, J.-G., Botella, J. R. & Jones, A. M. Heterotrimeric G protein signalling 

in the plant kingdom. Open Biol. 3, 120186 (2013). 

7. Ullah,H.eta/. Structure ofa signal transduction regulator, RACK1, from Arabidopsis 

thaliana. Protein Sci. 17, 1771-1780 (2008). 

8. Dell, E.J. etal. The By subunit of heterotrimeric G proteins interacts with RACK1 and 

two other WD repeat proteins. J. Biol. Chem. 277, 49888-49895 (2002). 

9. akashima, A. et al. RACK1 functions in rice innate immunity by interacting with 

the Racl immune complex. Plant Cell 20, 2265-2279 (2008). 

10. Chen, J.-G. et al. RACK1 mediates multiple hormone responsiveness and 

developmental processes in Arabidopsis. J. Exp. Bot. 57, 2697-2708 (2006). 

11. Asai, T.eta/. MAP kinase signalling cascade in Arabidopsis innate immunity. Nature 
415, 977-983 (2002). 

12. Suarez-Rodriguez, M. C. et a/. MEKK1 is required for flg22-induced MPK4 
activation in Arabidopsis plants. Plant Physiol. 143, 661-669 (2007). 

13. MAPK Group (Ichimura, K. et a/.). Mitogen-activated protein kinase cascades in 
plants: a new nomenclature. Trends Plant Sci. 7, 301-308 (2002). 

14. Guo, J. & Chen, J.-G. RACK1 genes regulate plant development with unequal 
genetic redundancy in Arabidopsis. BMC Plant Biol. 8, 108 (2008). 

15. Li, J.-F. et al. Comprehensive protein-based artificial microRNA screens for 
effective gene silencing in plants. Plant Cell 25, 1507-1522 (2013). 

16. Witzel, F., Maddison, L. & Bliithgen, N. How scaffolds shape MAPK signaling: what 
we know and opportunities for systems approaches. Front. Physiol. 3, 475 (2012). 


Acknowledgements We thank G. Tena for generating the mekk1/ 
pMEKK1::MEKK1(K361M) transgenic line, Y. Zhang for the summ1-1 mutant, 

M. C. Suarez-Rodriguez and P. J. Krysan for discussion, the Arabidopsis Biological 
Resource Center for tDNA insertion lines, and M. Curtis and U. Grossniklaus for the 
oestradiol-inducible binary vector. We thank S. Lory for P. aeruginosa PAO ADD1976, 
and M. B. Mudgett for pVSP61. We thank N. Clay, X. Dong, S. Somerville, and Ausubel 
laboratory members for reading the manuscript. This work was supported by Natural 
Sciences and Engineering Research Council of Canada and Banting Postdoctoral 
Fellowships awarded to Z.C., National Science Foundation grants MCB-0519898 and 
10S-0929226 and National Institutes of Health grants R37-GM48707 and P30 
DK040561 to F.M.A., and National Science Foundation grant |OS-0618292 and 
National Institutes of Health grant RO1-GM70567 to JS. 


Author Contributions Z.C., J.-F.L., J.S., and F.M.A. designed experiments, Z.C., J.-F.L., 
Y.N., X.-C.Z., O.Z.W., Y.X., S.D., Y.M., and J.B. performed experiments, Z.C., J.-F.L, BJ.M., 
J.S., and F.M.A. wrote the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to 

F.M.A. (ausubel@molbio.mgh.harvard.edu). 


©2015 Macmillan Publishers Limited. All rights reserved 


METHODS 


No statistical methods were used to predetermine sample size. 

Bacterial strains. P. aeruginosa strains used in this work were wild type and mu- 
tants of UCBPP-PA14 (refs 17, 18) and PAO ADD 1976 (ref. 19). The latter strain 
carries the chromosomally incorporated gene for T7 RNA polymerase under the 
control of the lac repressor and was used for production of His-tagged PrpL and 
His-tagged ArgC. Xanthomonas campestris strains were described previously”. 

A PA14/AprpL in-frame deletion mutant was constructed using a method de- 
scribed previously”! that employed a sequence that contained regions immediately 
flanking the coding sequence of the prpL gene. This fragment was generated by a 
standard three-step PCR protocol using Phusion DNA polymerase (New England 
Biolabs) and then cloned into the BamHI and HindIII sites of pEX18Ap”, result- 
ing in plasmid pEX18-AprpL. Plasmid pEX18-AprpL was used to introduce the 
deleted region into the wild-type PA14 genome by homologous recombination. 
Escherichia coli strain SM10 Apir was used for triparental mating”. 

For the purification of His-tagged protease IV or ArgC, the P. aeruginosa PA14 
prpL gene or the X. campestris strain 1946 argC gene were cloned into the EcoRI and 
Xhol sites of pETP30 (ref. 24), creating plasmids pETP-prpL or pETP-argC, which 
encode 6 His-tagged PrpL and 6X His-tagged ArgC, respectively. The resulting 
plasmids were transformed into P. aeruginosa PAO ADD1976 by electroporation” 
to generate the strains ADD/pETP-prpL or ADD/pETP-argC for purification of 
His-tagged protease IV or His-tagged ArgC, respectively. 

For argC complementation in Xanthomonas, the X. campestris strain 1946 argC 
gene was cloned into the BamHI site of pVSP61 (ref. 26), creating plasmid pVSP61- 
argC. An HA-tag was incorporated at the carboxy (C)-terminal of the argC gene to 
detect the complemented protein. The resulting plasmid and empty pVSP61 vector 
were transformed into X. campestris strain 8004 by triparental conjugation”’. 

Antibiotics were supplemented as needed: ampicillin or carbenicillin, 50 pg ml! 
for E. coli or 300 1g ml for P. aeruginosa; kanamycin 50 jg ml for E. coli and 
Xanthomonas campestris or 200 tg ml for P. aeruginosa; and rifampicin 100 pg ml’. 
Construction of Arabidopsis transgenic lines. Construction of amiR-rack1-es 
transgenic lines and the mekk1/pMEKK1::MEKK1(K361M) transgenic line was 
performed as follows: the BamHI/PstI fragment of pre-amiR-RACK1-4 (ref. 15) 
was inserted between the oestradiol-inducible promoter” and the NOS terminator 
in a modified pUC119-RCS vector**. The pre-amiR-RACK1-4 expression cassette 
was then cut out by AscI digestion and inserted into AscI-digested binary vector 
pFGC19-XVE-RCS”*, which expresses the XVE transcriptional activator” under 
the 35S promoter, to obtain pFGC-EST-RACK1. This latter plasmid was introduced 
into Agrobacterium tumefaciens GV3101 cells by electroporation, and GV3101/ 
pFGC-EST-RACK1 was used to generate transgenic Arabidopsis plants with in- 
ducible amiR-RACK1 expression using the floral dip technique*’. To generate 
mekk1/pMEKK1::MEKK1(K361M) transgenic Arabidopsis, an ~9.4kb MEKK1 
genomic fragment was used to complement a mekk1 null mutant (Salk_052557). 
This genomic fragment contains an ~3.9 kb promoter sequence upstream of the 
start codon, an ‘AAGG’ to ‘ATGG’ mutation in exon 2 (corresponding to K361M 
mutation in MEKK1) to disrupt MEKK1 kinase activity, and a double Flag-tag 
coding sequence upstream of the stop codon. 

Fractionation of the PA14 secretome. One litre of PA14 cells grown in M9 min- 
imal medium (6.8 gl’ Na,HPO,, 3g)’ KH,PO,, 0.5g1' NaCl, 1 gl”! NH,Cl, 
2mM MgSO,, 0.1 mM CaCh, 10 1M FeCl;, 0.4% glucose, 10 mg1]™ | thiamine) was 
centrifuged at 20,000g at 4 °C for 30 min and the pellet was discarded. The super- 
natant was filtered through a 0.22 1m low protein-binding filter (Corning). Secreted 
PA14 proteins in the filtrate were precipitated with ammonium sulphate (85% sat- 
uration) at 4°C overnight, followed by centrifugation at 20,000g at 4°C for 1h. 
The pellet was resuspended in 30 ml buffer A (20 mM Tris, pH 8.8), concentrated 
to 150 yl using Centrion Plus-70 filter (Millipore) to remove the excess ammonium 
sulphate, and diluted again into 10 ml buffer A. The protein sample was loaded onto 
a 1-mL DEAE anion-exchange chromatography column (GE Healthcare) that was 
washed with buffer B (20 mM Tris, pH 8.8, 1 M NaCl) and equilibrated with buffer 
A. Proteins were separated into 1-ml fractions with a linear gradient of buffer B (0- 
60% within 20-column volumes). The fractionation was performed at 4°C with a 
flow rate of 1 ml min™’. 

Purification of P. aeruginosa protease IV and X. campestris ArgC. Secreted pro- 
teins from ADD1976/pETP-prpL were precipitated as described above and resus- 
pended in lysis buffer (50 mM NaH,PO,, 300 mM NaCl, 10 mM imidazole, pH 8.0). 
The sample was loaded onto a 5-mL HisTrap Affinity Column (GE Healthcare) and 
the 6X His tagged PA14 protease IV was purified according to the manufacturer’s 
instructions. The eluted protease IV was concentrated to 150 ul and immediately 
subjected to a Superdex 200 gel filtration column (GE Healthcare). Purified prote- 
ase IV was exchanged into M9 minimal medium and filter-sterilized using a 0.22 Lm 
low protein-binding filter (Millipore). The concentration of the purified protease IV 
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was adjusted to 20 1M, aliquoted, and stored at —80 °C before being used for plant 
treatments. X. campestris protease ArgC was purified using the same protocol. 
Protease assay. The protease activity assay of protease IV and its homologue ArgC 
was determined as previously described*’. Protease IV and ArgC were inactivated 
by TLCK as previously described*'. 

Plant growth. Seeds were sterilized in 20% bleach for 2 min and washed three times 
with sterile water. Seedlings were grown in liquid MS medium (Murashige and 
Skoog basal medium with vitamins from Phytotechnology Laboratories supple- 
mented with 0.5 gl” ' MES hydrate and 0.5% sucrose at pH 5.7) in either 24-well 
assay plates (BD Falcon) (eight seeds and 0.5 ml medium per well) for MAPK assays, 
microarray and RT-qPCR analysis, callose induction and GUS expression, or 96- 
well plates (Greiner Bio-One) (one seed and 0.2 ml medium per well) for oxida- 
tive burst measurements. Plates were sealed with Micropore tape and placed on 
grid-like shelves over water trays on a Floralight cart in a plant growth chamber 
for 10days at 21°C with 75% relative humidity under 16h of daylight (65- 
70 )Em~*s~'). The media in 24-well plates was exchanged for fresh media on 
day 8, whereas the media in 96-well plates was exchanged for sterile water on day 9. 
Elicitor treatments. The synthetic peptide flg22 was synthesized by Genscript. 
Experimentally determined optimal concentrations of protease IV were as follows: 
20 nM for oxidative burst measurements, microarray and RT-qPCR analyses; 40 nM 
for MAPK activation; 100 nM for GUS expression; 500 nM for callose elicitation 
and the infection protection assay. For direct comparison, the same concentra- 
tions of flg22 and protease IV or ArgC were used in the same assays. Ten-day-old 
seedlings were treated with different elicitors for the following times unless other- 
wise specified: 6 h for GUS assays in reporter line CYP71A 12pro:GUS; 10 min for 
MAPK activation assays; 1h or 6h for RT-qPCR analysis of selected genes; and 
18h for callose induction. 

Transient silencing of MAPK or MAPKK genes in transgenic plants. In two 
independent mpk3,6-es transgenic lines, MPK3 was silenced with an oestradiol- 
inducible MPK3-RNAi construct ina null mpk6 mutant (Salk_062471) background. 
In two mkk4,5-es transgenic lines, a single oestradiol-inducible RNAi construct was 
used to target both MKK4 and MKK5 mRNAs. Details of the construction of the 
mpk3,6-es and mkk4,5-es transgenic lines will be described elsewhere. The trans- 
genic and control plants were grown in MS medium in a 24-well plate as described 
above for 4 days. Then the medium was changed to MS medium containing 10 1M 
oestradiol (Sigma, 100 mM stock in dimethylsulphoxide (DMSO)). After exposure 
to oestradiol for 3 days, the seedlings were treated with water and 40 nM purified 
protease IV for 10 min (for MAPK assays) or 20 nM purified protease IV for 1h 
(for RT-qPCR assays). 

Transient silencing of rack1 genes in protoplasts and transgenic plants. Meso- 
phyll protoplasts isolated from leaves of 4-week-old Arabidopsis plants (4  10* 
cells in 200 il) were transfected with 40 jig (20 pl) of amiR-RACK1-4 construct or 
empty artificial microRNA expression vector’? as a control. After 24h of express- 
ion, 100 nM flg22 or 100 nM purified protease IV was added to the protoplasts fol- 
lowed by incubation for 10 min before the cells were harvested for MAPK assays 
and rack1 gene silencing confirmation by RT-qPCR. 

For oestradiol-induced rack1 silencing in transgenic amiR-rack1-es lines, the 

wild-type Col-0 and transgenic plants were grown in MS medium in a 24-well plate 
as described above for 3 days. Then the medium was changed to MS medium con- 
taining 10 uM oestradiol (Sigma, 100 mM stock in DMSO). After exposure to oes- 
tradiol for 2 days, the seedlings were treated with water and 40 nM flg22 or 40 nM 
purified protease IV for 10 min (MAPK assay) or 20 nM flg22 or 20 nM purified 
protease IV for 6 h (RT-qPCR measuring transcript levels of CYP71A12, GST6, and 
the three rack1 genes). For the protease-[V-mediated protection assay against P. 
syringae DC3000, 20 LM oestradiol was infiltrated into 4-week-old control Col-0 
and transgenic amiR-rack1-es1 and amiR-rack1-es2 leaves 24h before the mock 
treatment or treatment with 500 nM purified protease IV. 
Mutant seed stocks. Transfer-DNA insertion lines gpal-4 (CS6534), agb1-2 
(CS6536), aggl-Ic (CS16550), agg2-1 (SALK_022447), gpal-4/agb1-2 (CS6535), 
agg1-Iclagg2-1 (CS16551), mekk1 (SALK_052557), rack1a-3 (CS862351), rack1b- 
2 (SALK_145920), rack1b-3 (CS863092), rackIc-2 (SALK_017913), and rack1c-3 
(SALK_001973) were obtained from the Arabidopsis Biological Resource Center. 
GUS histochemical assay. After treatment with 100 nM flg22 or 100 nM purified 
protease IV for 6 h, plants were washed with 50 mM sodium phosphate (pH 7) and 
0.5 ml of GUS substrate solution (50 mM sodium phosphate, pH 7, 10 mM EDTA, 
0.5 mM K,[Fe(CN)g], 0.5 mM K3[Fe(CN)¢], 0.5 mM X-Gluc, and 0.1% v/v Triton 
X-100) was added to each well. The plants were vacuum-infiltrated for 5 min and 
then incubated at 37°C for 4h. Tissues were fixed with a 3:1 ethanol:acetic acid 
solution at 4 °C overnight and placed in 95% ethanol. Tissues were cleared in lactic 
acid and then examined using a Discovery V12 microscope (Zeiss). For the screen 
of PA14 secretome fractions, 100 il of buffer A (20 mM Tris, pH 8.8) or different 
DEAE fractions were added to each well. 
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MAPK activity. Total proteins in seedling or protoplast lysates were resolved on a 
10% SDS-polyacrylamide gel electrophoresis (SDS-PAGE) gel and transferred to 
a polyvinylidene difluoride membrane. Western blot analysis was conducted by 
using anti-phospho ERK antibodies (Cell Signaling) as the primary antibody at 
1:10,000 dilution in 5% BSA and horseradish peroxidase (HRP)-conjugated anti- 
rabbit antibodies as the secondary antibody at 1:10,000 dilution in 5% non-fat milk. 
The immunoblot signal was visualized with a SuperSignal West Femto kit (Thermo 
Scientific). 

Oxidative burst measurement. HO, was detected using a luminol-HRP-based 
chemiluminescence assay. A 10 mg ml~' 500X HRP (Sigma-Aldrich) stock solu- 
tion was prepared by dissolving 10 mg HRP in water. A 20mg ml’ 500X luminol 
(Sigma-Aldrich) stock solution was prepared by dissolving 20 mg luminol in 100 mM 
KOH. For each elicitor, a master reaction mixture was prepared by diluting individual 
elicitor, HRP, and luminol stocks with water. The plates were kept in the dark for 
1 hbefore elicitation. The following procedures were performed in the dark. Liquid 
was removed at the end of the 1-h pre-treatment and 200 il of master reaction mix- 
ture was added into each well. Plates were placed into a 96-well scintillation reader 
immediately and light emission was monitored using a 96-well scintillation counter 
(1450 Microbeta Wallac TriLux Scintillation/Luminescence counter). Every plate 
was read for about 30 cycles. Kinetics of H,O, production was determined by plot- 
ting the average chemiluminescence counts from all the seedlings under the same 
condition over the reading period. Every time point is the mean value of 16 seedlings. 
RNA isolation and microarray and RT-qPCR analysis. Total RNA was isolated 
according to the manufacturer’s instructions using an RNeasy Plant Mini Kit (Qiagen). 
DNA was removed using the DNA-free kit (Ambion), and reverse transcription 
reactions were performed using an iScript cDNA synthesis kit (Bio-Rad). Comple- 
mentary DNA (cDNA) concentrations were measured using a Nano-drop instru- 
ment (Thermo Scientific). RT-qPCR was performed using a CFX96 real-time PCR 
machine (Bio-Rad) using iQ SYBR Green Supermix (Bio-Rad). The following PCR 
reaction programme was used: 95 °C for 3 min followed by 50 cycles of 95°C for 
30s and 55 °C for 30s. Fold change was calculated relative to plants treated with 
M9 buffer. Fold induction data represent the mean + s.d.,n = 3 with each contain- 
ing eight seedlings. Expression values were normalized to that of the eukaryotic 
translation initiation factor 4A1 (EIF4A1). The primers used were the following: 
EIF4A1 (At3g13920), 5'-GCAGTCTCTTCGTGCTGACA-3’ and 5’-TGTCATA 
GATCTGGTCCTTGAA-3’; CYP71A12 (At2g30750), 5’-GATTATCACCTCGG 
TTCCT-3’ and 5’-CCACTAATACTTCCCAGATTA-3'; WRKY30 (At5g24110), 
5'-GCAGCTTGAGAGCAAGAATG-3’ and 5’-AGCCAAATTTCCAAGAGGA 
T-3'; GST6 (At2g47730), 5’-CCATCTTCAAAGGCTGGAAC-3’ and 5'-TCGA 
GCTCAAAGATGGTGAA-3’; WRKY29 (At4g23550), 5’-ATCCAACGGATCA 
AGAGCTG-3’ and 5'-GCGTCCGACAACAGATTCTC-3'; WRKY33 (At2g38470), 
5'-GGGAAACCCAAATCCAAGA-3’ and 5’-GTTTCCCTTCGTAGGTTGTG 
A-3'; ERFI (At3g23240), 5’-TCGGCGATTCTCAATTTTTC-3’ and 5’-ACAAC 
CGGAGAACAACCATC-3’; rackla (Atlg18080), 5’-GCTGAAAAGGCTGAC 
AACAGT-3' and 5'-GCTCCAGTTAAGGCTTGTGC-3’; rack1b (Atl1g48630), 
5'-TTGTTGAGGATTTGAAGGTTGA-3’ and 5'-CCAGTTCAAGCTTGTGCA 
GTA-3'; racklc (At3g18130), 5'-GAGGCAGAGAAGAATGAAGGTG-3’ and 
5'-CCAGTTCAAGCTTGTGCAGTA-3’. WRRY gene induction was measured 
1 hafter elicitation, whereas CYP71A 12, ERF1, and GST6, were measured 6 h after 
elicitor treatment. 

For microarray analysis, RNA quality was assessed by checking the integrity of 
RNA on an Agilent 2100 Bioanalyzer (Agilent Technologies). Target labelling was 
performed according to the protocol given in the Affymetrix GeneChip 3’ IVT 
Express Kit Technical Manual. Microarray hybridizations and scanning were fin- 
ished at the Genomics Core, Joslin Diabetes Center, Boston, Massachusetts. Micro- 
array CEL files were read into the R statistical analysis software, version 2.15.2. Arrays 
were analysed together using the standard robust multi-array average procedure 
as implemented in Bioconductor’s ‘affy’ package, version 1.36.1 (refs 32, 33). Fold 
changes were calculated using log,-transformed expression values by subtracting 
the mean of control samples from the mean of treated samples. Microarray CEL 
files were also obtained from previous studies exploring the effects of flg22 and 
oligogalacturonides on gene expression’. These two experiments were subjected to 
the robust multi-array average procedure together, but downstream analyses (for 
example, fold change computations) were performed separately on the two treat- 
ments. The microarray data have been deposited in the GEO database under 
accession number GSE58518. 

Callose deposition assay. Elicitor-induced callose deposition in cotyledons of 10- 
day-old Arabidopsis seedlings was detected using aniline blue as described™*. Eigh- 
teen hours after elicitation, seedlings were fixed under a vacuum in 3:1 ethanol:acetic 
acid. The clearing solution was changed until the leaves were colourless. Tissues 
were washed in 70% ethanol and then 50% ethanol for at least 2h each time and 
rehydrated in several brief HO washes followed by an overnight HO wash. Sam- 
ples were then made transparent by several minutes in a vacuum with 10% NaOH 


followed by a 2-h incubation at 37 °C on a shaking platform. After several more 
H,0O washes, tissues were incubated in the dark at 21 °C for at least 4h with 0.01% 
aniline blue in 150 mM K,HPO, (pH 9.5). After mounting on slides in 50% glyc- 
erol, samples were examined with a Zeiss Axioplan microscope using ultraviolet 
illumination and a broadband 4',6-diamidino-2-phenylindole (DAPI) filter set 
(excitation filter 390 nm; dichroic mirror 420 nm; emission filter 460 nm). 
Pathogenicity assays. Arabidopsis pathogenicity assays, including infection by 
P. syringae strain DC3000 with or without pre-infiltration of protease IV or flg22, 
were performed according to previously described protocols”’. Data represent the 
mean of bacterial titres + s.d. of ten leaf disks excised from ten leaves of five plants. 
The infection protection assay was repeated three times with similar results. 

Xanthomonas pathogenicity assays in B. oleracea were performed according to 

previously described protocols** with modifications. Seeds of broccoli cultivar B. 
oleracea var. Marathon were sown in Fafard number 2 soil mix and grown ina 12-h 
light (70 hE m~s~') cycle at 19°C and 60% relative humidity. Individual seed- 
lings were transferred to 5 cm X 5 cm pots after one week and kept at a cycle of 16-h 
light (150 WE m 7s ') at 23°C followed by 8-h dark at 20°C and 70% relative 
humidity. After a further 2 weeks of growth, the 3-week-old plants were used for 
Xanthomonas infiltration. Fresh X. campestris overnight cultures were washed and 
adjusted to 10° cells per millilitre in 10 mM MgSO,. A standard infiltration protocol 
was used to infect 3-week-old leaves. After infection, the plants were transferred toa 
growth chamber with the following conditions: 12-h light (60 #Em~*s_') at 28°C 
at 90% relative humidity for 2 days before being harvested for counting of colony- 
forming units. Data represent the mean of bacterial titres + s.d. of ten leaf disks 
excised from ten leaves of five plants. The infection protection assay was repeated 
three times with similar results. 
Co-immunoprecipitation. For co-immunoprecipitation performed in protoplasts, 
mesophyll protoplast isolation from leaves of 4-week-old Arabidopsis plants and 
polyethylene glycol (PEG)-mediated DNA transfection were performed as prev- 
iously described**. Co-immunoprecipitation was performed as described prev- 
iously”’ with modifications. Briefly, 100 ,1g (50 pl) of PREY plasmids were used to 
co-transfect 1 ml Arabidopsis mesophyll protoplasts (5 X 10° cells) with 100 pg 
(50 pl) of BAIT plasmids or empty vectors. After 6 h to allow protein expression, 
the cells were pelleted and lysed in 200 ll of immunoprecipitation buffer (10 mM 
HEPES, pH 7.5, 150 mM NaCl, 1 mM EDTA, 10% glycerol, 1% Triton X-100, 1x 
Roche EDT A-free protease inhibitor cocktail) by vigorous vortexing for 1 min. Twenty 
microlitres of lysate was saved as the input fraction to ensure that the Prey proteins 
were expressed equally in all samples. The rest of the lysate (180 11) was mixed with 
320 pl immunoprecipitation buffer and vigorously vortexed for 1 min. The result- 
ant clear lysate was centrifuged at 21,000g for 10 min at 4 °C, and the supernatant 
was incubated with a 10 ul slurry of anti-Flag M2 agarose beads (Sigma) or anti-HA 
magnetic beads (Pierce) for 3 h at 4 °C. The beads were washed three times with the 
immunoprecipitation buffer and once with 50 mM Tris-HCl, pH 7.5. The eluate 
was obtained by boiling the beads in 40 jl of SDS-PAGE loading buffer and the pres- 
ence of co-immunoprecipitated PREY proteins was detected by immunoblotting 
analysis using HRP-conjugated anti-HA antibody or anti-Flag (Roche) at 1:10,000 
dilution; the immunoblot signal was visualized using a SuperSignal West Femto 
kit (Thermo Scientific). The same membrane was stripped and re-used to detect 
the comparable amounts of immunoprecipitated BAIT proteins by immunoblot. 
BiFC. For plasmids used in the split-mCherry assay, the coding sequence of the 
amino (N)-terminal fragment (mCherryN, amino acids 1-159) or the C-terminal 
fragment (mCherryC, amino acids 160-235) of mCherry was PCR amplified, 
digested by BamHI/Notl, and inserted into the same digested pAN vector, which 
contained a double 35S promoter and a NOS terminator, to obtain pcCherryN and 
pcCherryC plasmids. Genes for protein-protein interaction tests were inserted into 
the XbaI/BamHI-digested pcCherryN or pcCherryC vectors after digestion of their 
PCR products with Xbal (or Spel, Nhel if the Xbal site was present in the gene) at 
the 5’ end and with BamHI (or BgllI if the BamHI site was present in the gene) at 
the 3’ end, allowing the expression of a chimaeric gene of interest with the coding 
sequence of mCherryN or mCherryC at the 3’ end. 

For binary plasmids used in the BiFC assay in agroinfiltrated N. benthamiana 
leaves, pFGC-RCS (kanamycin resistant) and pPZP-RCS (spectinomycin resistant) 
binary vectors were constructed by replacing the original sequences between EcoRI 
and HindIII of pFGC19 and pPZP222 with the multiple cloning site sequence from 
pUC119-RCS flanked by EcoRI and HindIII**. Subsequently, the entire expression 
cassette of ‘gene’-mCherryN was PCR amplified from protoplast expression plas- 
mids, digested by Ascl and inserted into the Ascl site of pFGC-RCS, while the entire 
expression cassette containing the ‘gene’-mCherryC fusion DNA was PCR amp- 
lified from protoplast expression plasmids, digested by AscI and inserted into the 
Ascl site of pPZP-RCS. A pair of pRGC-RCS and pPZP-RCS plasmids expressing a 
pair of genes for protein-protein interaction tests were co-transformed into Agro- 
bacterium GV3101 cells by electroporation, and cells transformed with both binary 
plasmids were selected by the addition of both kanamycin and spectinomycin to 
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the growth medium. Leaves of 4- to 5-week-old N. benthamiana plants were infil- 
trated with agrobacteria (final attenuance, Déoo nm = 0.01) containing constructs 
expressing the mCherryN fragment fused to GPA1, AGB1, or MAPKs and the 
mCherryC fragment fused to RACK1A/B/C. The agroinfiltration experiment was 
performed as described previously”. 

Arabidopsis protoplasts 18 h after transfection and N. benthamiana leaf pieces 
2 days after agroinfiltration were imaged using a Leica DM-6000B upright fluor- 
escence microscope with phase and differential interference contrast equipped with 
a Leica FW4000 digital image-acquisition and processing system. 
SFLC. For plasmids used in the SFLC assay, the genes for protein-protein inter- 
action tests were inserted into the Xbal/BamHI-digested pcFLucN or pcFLucC 
vectors” after digestion of their PCR products with Xbal (or Spel, Nhel if the Xbal 
site was present in the gene) at the 5’ end and with BamHI (or BglII if the BamHI 
site was present in the gene) at the 3’ end, allowing the expression of a chimaeric 
gene of interest with the coding sequence of FLucN or FLucC at the 3’ end. 

SFLC experiments performed in protoplasts were performed as described prev- 
iously*’. Briefly, 10 ug (5 pl) of PREY plasmids were used to co-transfect 100 ul of 
Arabidopsis mesophyll protoplasts (5 X 10° cells) with 10 pg (5 pl) of BAIT plas- 
mids. One microgram of UBQ10::GUS plasmid was used in each transfection as 
an internal normalization control. After 6h to allow for protein expression, the 
luminescence of each sample was recorded by a GloMax-Multi microplate multi- 
mode reader (Promega) with the integration time set as 1s. 
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Extended Data Figure 1 | Protease IV-triggered GUS staining in 
CYP71A12pro:GUS transgenic Arabidopsis seedlings. a, Activation of 
CYP71A 12pro:GUS by a DEAE fraction of the PA14 secretome (left) and 
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purification of the eliciting activity by DEAE chromatography (right). 
b, Activation of CYP71A 12pro:GUS in 10-day-old seedlings by 100 nM purified 
PrpL. The experiments in a and b were repeated three times with similar results. 
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Extended Data Figure 2 | Trypsin does not activate MAPK cascade or elicit 
an oxidative burst in Arabidopsis. a, Western blot depicting activation of 
MAPKs by 40 nM flg22, or 40 nM purified PrpL, or trypsin in 10-day-old 
seedlings. The same molecular mass region of the western blot is shown 

as in Fig. 1b. b, Chemiluminescence assay showing elicitation of an oxidative 
burst in 10-day-old seedlings by 20 nM purified PrpL or trypsin. Error bars, s.d.; 
n = 16 individual seedlings. 
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Extended Data Figure 3 | Transcriptional analysis of purified protease IV. 
a, Genome-wide transcriptomic profiles obtained with Affymetrix Arabidopsis 
ATH1 GeneChips of 10-day-old seedlings treated with 20 nM purified PrpL 
and comparison with published flg22 and oligogalacturonide responses. A 
Venn diagram shows the similarity of expression behaviour (|fold change| > 2) 


in response to the three treatments. b, Defence gene induction levels measured 
by RT-qPCR in 10-day-old Col-0 seedlings treated with 20 nM purified PrpL 
or 20 nM flg22 for 1h (WRKY29, 30, and 33) or 6h (GST6, ERF1, and 
CYP71A12). Data represent mean ~ s.d.; n = 3 biological replicates, each 
containing eight seedlings. 
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Extended Data Figure 4 | Protease-IV-triggered responses are dependent PrpL or 20 nM flg22 or 20 nM heat-treated flg22 measured by RT-qPCR. 
on proteolytic activity. a, Induction of defence-related genes by 20nM d, Chemiluminescence assay showing elicitation of an oxidative burst by 20 nM 
purified PrpL or inactive variants of PrpL measured by RT-qPCR. b, Induction __ purified PrpL, 20 nM inactive variants of PrpL, or 20nM TLCK-treated PrpL. 
of defence-related genes by 20 nM purified PrpL or 20 nM flg22, or 20nM Data represent mean + s.d.; n = 3 biological replicates with each experiment 
TLCK-treated PrpL or 20nM TLCK-treated flg22 measured by RT-qPCR. contains eight seedlings (a-c) and n = 16 individual seedlings (d). 


c, Induction of defence-related genes by 20 nM PrpL or 20 nM heat-treated 
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Xcr ArgC VANSLOTYWF YDAASCNG--NTASAQATTLTGGAFLRHANTTRDTALLELKTAPPSGAFY 353 
* 2 *** kKeKEs 35. * * * & ee KL RKKKK : * 220*** KRKKKK peek KK 
PA14 PrpL QGWSATPI-ANGSLGHDIHHPRGDAKKYSQGNVSAVGVTYDGHTALTRVDWPSAVVEGGS 408 
Xcr ArgC AAWNSAAIGATGTSIVGIHHPSGDVKKYSLGSVTGLNTSIDGRTPLYRVVWNDGVTEGGS 413 
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Xcr-1946 406 TGCGGGGGG-CCGGCGCAACGCCGGGCGATCCGTCCAAGGTGACGCTGCGCTTTGCCGGC 464 
Xcc-8004 406 TGCGGGGGGECCGGCGCAACGCCGGGCGATCCGTCCAAGGTGACGCTGCGTTTTGCCGGC 465 
Xcc-BP109 406 TGCGGGGGGECCGGCGCAACGCCGGGCGATCCGTCCAAGGTGACGCTGCGTTTTGCCGGC 465 


Xcr-1946 465 GACGATGGCCGCGTGTTCGAGCAATCCGGCACCAGCTTCAGCACCAGTGGCAGTGACATC 524 
Xcc-8004 466 AACGATGGCCGCGTGTTCGAGCAATCCGGTACCAGCTTCAGCACCAGTGGCA CATC 525 
Xcc-BP109 466 AACGATGGCCGCGTGTTCGAGCAATCCGGTACCAGCTTCAGCACCAGTGGCA( CATC 525 
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Extended Data Figure 5 | Sequence analyses of Xanthomonas argC genes. _X. campestris pv. campestris strains BP109 (Xcc-BP109); X. fuscans subsp. 


a, The protein sequence alignment between P. aeruginosa PA14 PrpL and fuscans strain 4834-R (Xf-4834-R); X. campestris pv. vesicatoria (Xcv). 
X. campestris pv. raphani strain 1946 ArgC (Xcr ArgC). b-d, Three e. Activation of CYP71A12pro:GUS in 10-day-old seedlings by culture filtrate 
independent presumptive null mutations in the Xanthomonas argC gene: an —_ from X. campestris strain Xcr-1946, Xcc-8004, or Xcc-BP109, and X. campestris 
insertion of G, a single nucleotide mutation, and a deletion. The extra G is strain 8004 complemented with a functional argC gene (8004/argC) or 


highlighted in black in b; the single nucleotide substitution is indicated by an _ transformed with empty vector (8004/vector). Detection of HA-ArgC with 
arrow inc; and the single base deletion is highlighted in blackind. The resulting an anti-HA antibody. The GUS staining was repeated three times with similar 
premature stop codons are highlighted in red. Sequences were aligned to the _ results and the representative images shown were selected from at least 

argC allele in X. campestris pv. raphani strain 1946 (Xcr-1946), from which the _ three images. 

argC gene was cloned. X. campestris pv. campestris strains 8004 (Xcc-8004); 
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Extended Data Figure 6 | G proteins are required for protease IV response. _ showing elicitation of an oxidative burst by 20 nM purified PrpL or 20 nM flg22 
a, Induction of CYP71A12 and GST6 gene expression by 20nM purified PrpL _in wild-type Col-0 or G-protein tDNA mutants. Data represent mean = s.d.; 
in 10-day-old wild-type Col-0, gy single mutants (agg1-1c and agg2-1), or a n = 3 biological replicates with each containing eight seedlings (a) and n = 16 
gyly2 double mutant measured by RT-qPCR. b, Chemiluminescence assay individual seedlings (b); **P < 0.01, Student’s t-test. 
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Extended Data Figure 7 | Interactions between RACK and Gf or 
MAPKs. a, Split-mCherry assay in 4-week-old Agrobacterium-infiltrated 

N. benthamiana leaves. Images were pseudocoloured for visualization. Scale 
bar, 100 pm. RACK1A, B, C proteins were fused with the C-terminal half of 
mCherry and the potential interaction partner proteins were fused with the 
N-terminal half of mCherry. b, Split-mCherry assay in Arabidopsis protoplasts. 
RACK1A protein was fused with the C-terminal half of mCherry and the 
potential interaction partner proteins were fused with the N-terminal half of 
mCherry. green fluorescent protein (GFP) was included in each experiment to 
serve as a transfection control. Images were pseudocoloured for visualization. 
Scale bar, 10 um. ¢, Relative interaction intensity between RACK1A and G 
proteins or MAPKs measured by SFLC. RACK1A protein was fused with the 
FLucN or FLucC to pair with G proteins or MAPKs fused with the other half 
of firefly luciferase. Both constructs were co-expressed in protoplasts for 6h 
and the complemented luciferase activity was used to relatively quantify 
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protein-protein interactions. UBQ10::GUS was included in each experiment 
to serve as a transfection normalization control. Data represent mean = s.d.; 
n = 3 technical replicate samples. d, Protoplasts were co-transfected with 
GPA1-HA or AGB1-HA and RACK1B/C-Flag or a control vector. Co- 
immunoprecipitation was performed with an anti-Flag antibody. Top: the 
expression of GPA1 or AGB1 protein. Middle: AGB1, but not GPA1, co- 
immunoprecipitates with RACK] proteins. Bottom: pulldown of RACK1 
proteins by anti-Flag antibody. Protoplasts were treated with 100 nM purified 
PrpL for 15 min. e, Co-immunoprecipitation between GPA1 or AGB1 and 
RACKIA was performed in wild-type Col-0 or gaf mutant Arabidopsis 
mesophyll protoplasts. Numbers on the left of blots represent marker size in 
kilodaltons. f, Mass spectrophotometric analysis of endogenous proteins pulled 
down by Flag-tagged MEKK1(K361M). A peptide conserved in all three 
RACK] proteins is shown. The experiments in a and b were repeated three 
times with similar results. 
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Extended Data Figure 8 | Protease IV-triggered defence responses in wild- 
type Col-0 and MAPK mutants. a, Induction of WRKY30 and WRKY33 gene 
expression by 20 nM purified PrpL in 7-day-old seedlings of wild-type Col-0 
and transgenic mpk3,6-es1/2 and mkk4,5-es1/2 plants in the absence or 
presence of oestradiol. b, Western blot depicting activation of MPK3 and 
MPK6 by 40 nM purified PrpL in 7-day-old seedlings of wild-type Col-0 and 
transgenic mkk4,5-es1 plants in the absence or presence of oestradiol. The 
same molecular mass region of the western blot is shown as in Fig. 1b. 

c, Induction of WRKY30 and WRKY33 gene expression by 20 nM purified PrpL 
in 10-day-old wild-type Col-0 and mekk1/pMEKK1::MEKK1(K361M) mutant 
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seedlings. d, Induction of WRKY30 and WRKY33 gene expression by 20 nM 
purified PrpL in 4-day-old wild-type Col-0 and mekk1 null mutant seedlings. 
e, Western blot depicting activation of MPK3 and MPK6 by 40 nM purified 
PrpL in 10-day-old wild-type Col-0 and mekk1/pMEKK1::MEKK1(K361M) 
mutant seedlings or 4-day-old wild-type Col-0 and mekk1 null mutant 
seedlings. The same molecular mass region of the western blot is shown as in 
Fig. 1b. Data represent mean = s.d.; n = 3 biological replicates with each 
containing eight seedlings (a, c, d); **P < 0.01; *** P< 0.001, Student’s 
t-test versus Col-0 controls. 
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Extended Data Figure 9 | RACK] proteins are required for protease IV 
response. a, Western blot depicting activation of MAPKs by 40 nM purified 
PrpL or 40 nM flg22 in 5-day-old seedlings of wild-type Col-0 and individual 
rack1::tDNA insertion mutants. The same molecular mass region of the 
western blot is shown as in Fig. 1b. b, Induction of CYP71A12 by 20 nM purified 
PrpL or 20 nM flg22 in 5-day-old seedlings of wild-type Col-0 and individual 
rack1::tDNA insertion mutants. c, RT-qPCR analysis of rackla, rack 1b, 

and rack1c transcript levels in the 5-day-old Col-0 or amiR-rack1-es1 and 


rack1c 


amiR-rack1-es2 seedlings. d, RT-qPCR analysis of rack1a, rack1b, and rack1c 
transcript levels in Arabidopsis protoplasts transfected with amiR-RACK1-4 or 
artificial microRNA control. e, Western blot depicting activation of MAPKs by 
40 nM purified PrpL or 40 nM flg22 in Arabidopsis protoplasts transfected 
with amiR-RACK1-4 or artificial microRNA control. The same molecular mass 
region of the western blot is shown as in Fig. 1b. Data represent mean + s.d.; 
n = 3 biological replicates (b-d); *P < 0.05; **P < 0.01, Student’s t-test. 
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Extended Data Table 1 | P. aeruginosa PA14 transposon mutants screened for activation of CYP71A12pro:GUS 


names IDs names IDs names IDs 
“aprA 865 71 | ‘toxA 399 2. cpB  130~=~«*6 
aprD) 7385 = 1 xcpP 3450 2 hep1 4311 6 
aprE 1317 1 xcpQ 417 2 hcpA 4107 6 
aprF 922 1 xcpR 812 2 sinR 1865 6 
aprl 4760 1 xcpT 4498 2 stp1 3334 6 
aprx 1421 1 xcpW 3292 2 vgrG2 141 6 
hasAp 3774 1 xcpZ 4249 2 gacA_ 631 R 
hasF 1253 1 xphA 4239 2 lasl 3828 R 
cbpD 1394 2 xqhA 246 2 rhil 3829 R 
cupB5 75 2 exoT 7001 3 rhlIR 3229 R 
lasA 1299 2 exoU 339 3 rpoS_ 2108 R 
lasB 759 2 exoY 7430 3 fimL 627 S$ 
lipA 2386 2 pscD 1303 3 figB 4759 S$ 
lipC 2417 2 aaaA 375 5 figK 352 §$ 
pepB 629 2 eprS 69 5 fliC 1029 S$ 
phoA 914 2 estA 354 5 fliN 4482 S$ 
phoB 3473 2 lepA 631 5 motA 2879 S$ 
phoR 1112 2 tps? 556 5 motB 1935 S$ 
plcB 1956 2 tps2. 600 5 pilA 7353S 
plcH = =210 2 tps3 5555 pilD 2579 S 
picN 267 2 tps5 5450S tadZ 1689 S 


pmpA_ 93 2 


* Numbers represent the type of secretion system. For example, ‘2’ means type Il secreted protein or type Il secretion machinery protein. R, regulatory proteins; S, surface proteins. 
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YAP is essential for tissue tension to ensure 


vertebrate 3D body shape 
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Vertebrates have a unique 3D body shape in which correct tissue and 
organ shape and alignment are essential for function. For example, 
vision requires the lens to be centred in the eye cup which must in 
turn be correctly positioned in the head’. Tissue morphogenesis 
depends on force generation, force transmission through the tissue, 
and response of tissues and extracellular matrix to force*’. Although 
a century ago D’Arcy Thompson postulated that terrestrial animal 
body shapes are conditioned by gravity*, there has been no animal 
model directly demonstrating how the aforementioned mechano- 
morphogenetic processes are coordinated to generate a body shape 
that withstands gravity. Here we report a unique medaka fish (Oryzias 
latipes) mutant, hirame (hir), which is sensitive to deformation by 
gravity. hir embryos display a markedly flattened body caused by 
mutation of YAP, a nuclear executor of Hippo signalling that regu- 
lates organ size. We show that actomyosin-mediated tissue tension 
is reduced in hir embryos, leading to tissue flattening and tissue 


st. 25 


hir mYAPKDhir c 


Epiboly (%) 


Figure 1 | Organ/tissue collapse and misalignment in hir mutants. 

al, al’, Lateral view of live WT and hir mutant embryos, anterior to the left. 
Arrowheads, heart. Brackets, embryo thickness. a2, a2’, Dorsal view, anterior 
upwards. Arrowheads, mislocated lenses. a3, a3’, Transverse section at the 
plane shown in al and al’. Neural tubes (black dots) and somites (red dots). 
b1-b3, Lateral and b1’-b3’, dorsal views of live embryos. Arrowheads, 
blastoderm margin. c, Quantification of epiboly (%). Error bars + s.e.m. 


misalignment, both of which contribute to body flattening. By ana- 
lysing YAP function in 3D spheroids of human cells, we identify the 
Rho GTPase activating protein ARHGAP18 as an effector of YAP 
in controlling tissue tension. Together, these findings reveal a previ- 
ously unrecognised function of YAP in regulating tissue shape and 
alignment required for proper 3D body shape. Understanding this 
morphogenetic function of YAP could facilitate the use of embryonic 
stem cells to generate complex organs requiring correct alignment 
of multiple tissues. 

Via exhaustive mutant screening in medaka and zebrafish**, we 
identified medaka hir mutants displaying pronounced body flattening 
around stage (st.) 25-28 (50-64 h post fertilization, hpf; Fig. 1a). Although 
general development was not delayed, hir mutants exhibited delayed 
blastopore closure (Fig. 1b, c) and progressive body collapse from 
mid-neurulation (st.20, 31 hpf) (Fig. 1d), surviving until just before 
hatching (6 days post-fertilization, dpf). During body collapse, tissues 


d 


st. 22 


st. 24 st. 26 st. 28 


hir 


st. 23 


st. 20 


peBEB 
2 3 4 > ts} Bs 


st. 21 st. 21.5 st. 22 


(***P < 0.001; one-way ANOVA with Dunnett’s T3 post hoc. Fig. 1 Source 
Data). d, Transverse sections at 5th somite level, neural tube (encircled) and 
somites (blue) by myoD in situ hybridization. e, Time-lapse sequence of dorsal 
view of WT and hir mutant right eyes. Arrowheads, lens placode; arrows, 
invaginating retina. Fragmented and detaching lens placode demarcated by 
dotted lines in el’ and e2’. Scale bars, 40 tum. 
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Figure 2 | Tissue tension is reduced in hir mutants. a, Embryos kept left side 
down (a1, a1’), dorsal facing down (a2, a2’) and right side down (a3, a3’) from 
st. 17-25, stained with phalloidin (green, F-actin) and TO-PRO-3 (blue, 
nucleus). Large black arrow, direction of gravity; 0, angle that the tangent along 
the brain ventricle (dotted lines in al, al’) makes with horizontal solid line. 
b, Range of collapse of mutant and WT embryos kept sideways. Error 

bars, + s.e.m. **P < 0.01, t-test (see Fig. 2 Source Data). c, Immunoblotting of 
phospho-myosin regulatory light chain (pMRLC, Ser19) and control 
(GAPDH) (Supplementary Fig. 1). d, Actomyosin-labelled Tg(actb2:myl12.1- 
eGFP) zebrafish embryos at 75% epiboly. Arrowhead, YSL actomyosin ring 
(AR) at the margin of the EVL. Bracket, for analysis of EVL shape anisotropy 
(Extended Data Fig. 3a). e, The actomyosin ring was cut along a 20 jtm-long line 
(red) perpendicular to the EVL/YSL boundary in MO-injected embryos, 
when control MO injected embryos were at 70-80% epiboly. f, Particle image 
velocimetry (PIV) quantifies the velocity field (yellow arrows) of the recoiling 
actomyosin network. g, Averaged temporal recoil velocity curves, control 
MO (n= 41) and YAP;TAZ KD conditions (n = 50). Error bars, error of the 
mean at 95% confidence. Exponential fit function with a linear offset (black 
solid line) yields the characteristic decay time (inset) and h, the initial recoil 
velocity for the control MO (23.8 + 2.3 ym min ') and YAP;TAZ KD 
conditions (11.2 + 0.8 pm min '). Error bars, 95% confidence interval for 
the fit results. i, Snapshot at the end of aspiration (600 s) of st.22 neural tube 
with constant pressure (AP = 4.5 mbar). j, The curves of the tongue length over 
time to measure the aspiration of WT, hir mutant and ROCK inhibitor 
(Y¥27632)-treated neural tube explants. Error bars, + s.d. Maximum tongue 
lengths measured at 600 s were compared by t-test in k. Box plots represent 5%, 
25%, median, 75%, and 95%. *P < 0.05, ***P < 0.001. Scale bars, 40 um 

in a, i, 10pm ine. 


and organs including neural tube and somites gradually became flat- 
tened and improperly aligned (Fig. 1d). Lenses were misaligned outside 
the eyes (Fig. 1a2, a2’). Mutant lens placodes expressing sox3 formed 
normally adjacent to the retina up to st. 20, but then became fragmen- 
ted and detached from the retina (Fig. lel’, e2’, Extended Data Fig. 1a, b 
and Supplementary Videos 1, 2). These fragments gradually rounded 
up with some re-attaching to the retina to form ectopic lenses that were 
not incorporated (Fig. le). Thus, tissue flattening and misalignment 
defects are associated with the flattened mutant phenotype. 
Positional cloning identified a nonsense mutation of Leul64 (TTG 
to TAG) in the WW1 domain of YAP in hir (Extended Data Fig. 1c, d). 
YAP is the nuclear executor of the Hippo pathway and regulates organ 
growth via stimulation of cell proliferation’. In wild-type (WT) embryos, 
YAP transcripts are ubiquitous throughout development’®. Medaka 
maternal YAP messenger RNA (mRNA) was present at st. 10 in hir 
before onset of zygotic gene expression, but undetectable after st. 18 
(Extended Data Fig. le). Morpholino oligonucleotide (MO) YAP 
knockdown (KD) in WT embryos recapitulated the hir phenotype 
(Extended Data Fig. 2a—c, Supplementary Tables 1, 2), and ubiquitous 
recombinant YAP mRNA expression rescued the hir phenotype (Ex- 
tended Data Fig. 1f). In addition, perturbation of maternal YAP mRNA 
translation in hir mutant embryos by YAP translation-blocking (TB) 
MO (mYAP KD hir embryos) elicited a more severe blastopore clo- 
sure and body flattening phenotype than in hir zygotic YAP mutants 
(Fig. 1b3, b3’, c, Supplementary Table 2). Blastopore closure defects, but 
not flattening, have been reported in YAP KD zebrafish and Xenopus". 
Since TAZ is a functional paralogue of YAP”, we evaluated its con- 
tribution to the YAP KD phenotype in zebrafish. YAP;TAZ double 
KD zebrafish embryos exhibited more pronounced blastopore closure 
defects than YAP KD alone (Extended Data Fig. 2d-h). YAP-4SA, which 
lacks four serines and predominantly localizes to the nucleus", rescued 
the hir phenotype more efficiently than WT YAP (Extended Data Fig. 1), 
suggesting that the hir phenotype depends on nuclear YAP. The main 
nuclear function of YAP is to promote proliferation and inhibit cell 
death"*. hir embryos had increased cell death from st.22 to 26 after 
body flattening had initiated (increased cell death per se does not lead 
to body flattening**). Cell proliferation remained close to normal in hir 
embryos but was strongly suppressed in TAZ KD (and YAP/TAZ double 
KD) medaka embryos (Extended Data Fig. 2i, j). Thus, in medaka, cell 


proliferation is mainly regulated by TAZ, while YAP is predominantly 
required for 3D body shape. 

Three dpfhir mutants showed different orientations of body flattening. 
We therefore examined whether collapse correlated with the direction 
of gravity. Mutant embryos maintained either right-side or left-side 
down relative to the earth collapsed towards the earth as indicated 
by the ventricle tangent (Fig. 2a). Average collapse angle, 0, in mutant 
embryos was 17.3 + 10.7° (n = 14; Fig. 2b) compared to 5.6 + 3.3° 
(n = 26, P< 0.01) in WT. Mutant embryos maintained dorsal side down 
exhibited apparently uniform dorso-ventral compression (Fig. 2a2, a2’). 
Thus, flattening in hir embryos reflects an inability to withstand external 
forces (that is, gravity), suggesting reduced tissue tension. 

Tissue tension is generated primarily by actomyosin contraction”. 
During WT organogenesis, global levels of phosphorylated myosin 
regulatory light chain (pMRLC), indicative of actomyosin activity, 
increased (Fig. 2c), while in hir mutants they began decreasing as the 
blastopore closes (st. 17, 25 hpf), and continued decreasing coincident 
with tissue collapse and body flattening. To assess tissue tension during 
blastopore closure, we analysed a surface epithelial cell layer, the envel- 
oping layer (EVL)'® (Extended Data Fig. 3al). Comparison of EVL 
shape anisotropy between WT and hir embryos suggested that tissue 
tension in hir is reduced within the EVL (Extended Data Fig. 3a, b). We 
also quantified actomyosin network tension within the yolk syncytial layer 
(YSL) of zebrafish embryos with compromised YAP function expres- 
sing enhanced green fluorescent protein (EGFP)-myosin light chain 
protein, Tg(actb2:myl12.1-eGFP)'’. The YSL actomyosin network 
close to the EVL margin (Fig. 2d) was cut along a 20-um-long line 
perpendicular to the margin to reveal circumferential tension (Fig. 2e). 
Recoil velocities were significantly reduced in YAP;TAZ KD (n = 50) 
compared to control KD embryos (n = 41; 11.2 + 0.8 um min | vs 
23.8 + 2.3 um min_‘) (Fig. 2f-h), suggesting reduced actomyosin net- 
work tension. Consistent with this, epiboly movements in YAP;TAZ 
double KD zebrafish embryos were significantly reduced (KD embryos: 
53.63 + 3.93%; control embryos: 70.0 + 2.18% deep cell epiboly). To 
test whether reduced actomyosin network tension is also responsible 
for neural tube tissue flattening in hir, we performed micropipette aspi- 
ration experiments'®. hir neural explants were significantly less resistant 
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Figure 3 | Cell and tissue dynamics in hir mutants. a, Schematic: hir 
neural tube collapse is associated with long chain-like arrangements of 
neuroepithelial cells generated by increased cell slippage and randomized 
oriented cell division (Extended Data Figs 4, 5). b, Whole-mount FN 
immunohistochemistry (IHC) of st.22 embryos, dorsal view, anterior to the 
top. b1-b1'’, WT embryos injected with out-of-frame 70 kDa N-terminal 
medaka FNla+1b mRNA (250 pg) (n = 20); b2-b2’’, uninjected hir mutants 
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than WT to external forces applied by aspiration, indicating reduced 
neural tube tissue tension. The higher deformability of hir neural tube 
tissue was paralleled when myosin activity was reduced by ROCK inhi- 
bition (Fig. 2i-k). Together, these analyses indicate that YAP is required 
for actomyosin-mediated tissue tension in medaka and zebrafish. 
Single-cell tracking analysis of the growing neural tube in hir showed 
that tissue flattening was associated with failure to stack cells, and increase 
in cells slipping to one side after perpendicular cell division (Fig. 3a, 
Extended Data Figs 4, 5). Live imaging showed loss of filopodia that 
tether lens to retina during lens invagination’ (Extended Data Figs 1b, 
6a, b). The formation of lens-retina filopodia requires fibronectin (FN)- 
integrin signalling and contractile actomyosin’. While st. 22 WT embryos 
had elongated thin FN fibrils between invaginating lens and retina, hir 
retina showed punctate FN patches (Fig. 3b1’’, b2’”), suggesting defec- 
tive FN fibril formation. In addition, large ectopic FN deposits were 
found on the retina in hir (Fig. 3b2’). Similar loss of normal FN fibrils 
and formation of large FN deposits were observed throughout hir embryos 
(Fig. 3b4’, b5’). Furthermore, integrin 81 accumulation between lens 
and retina was lost in hir (Extended Data Fig. 6c). In contrast, cell-cell 
adhesion and apical markers, including pan-cadherin, atypical PKC 
(aPKC) and ZO-1, were unaltered in hir (data not shown). Mosaic 
expression of YAP in hir and transplantation experiments both showed 
that the hir mutation acts in a non-cell-autonomous manner (Extended 
Data Fig. 7, Supplementary Table 4). For instance, in invaginated hir 
lens rescued by mosaic expression of YAP, non-YAP expressing hir 
cells recovered filopodia (Extended Data Figs 7b, 6b). These data sug- 
gest that YAP functions in tissue alignment by regulating FN assembly. 
To identify downstream YAP effectors regulating tissue tension, we 
used a human 3D spheroid in vitro culture system employing the human 
retina pigmented epithelial cell line hTERT-RPE1 (RPE1), which dis- 
played a relatively mild proliferation defect upon YAP KD. YAP KD 
spheroids collapsed upon exposure to external forces by slow centrifu- 
gation, unlike normal spheroids (Fig. 4a, b). pMRLC levels were re- 
duced in YAP KD spheroids (Fig. 4c), as in hir, suggesting that YAP 
maintains tissue tension also in human 3D tissues. YAP KD spheroids 
also lacked the typical beehive-like pattern of FN fibrils and, instead, 
contained large FN deposits, reminiscent of the hir retina phenotype 


(n = 11); b3-b3’’, WT embryos injected with N-terminal 70kDa FN1a+ 1b 
mRNA (250 pg) (” = 39). b1-b3, Left anterior head of live embryos (asterisks, 
lens; triangle, forebrain ventricle); b1’—b3’, left eye of FN IHC (green), boxed 
area magnified in b1’’—b3’’; b4, b5, surface view of FN stained neural tube, 
WT (n= 15) and hir (n = 14) corresponding to the region in 1 and 2, 
respectively, boxed area magnified in b4’ and b5’. Arrowheads, FN fibrils/puncta; 
arrows, FN large deposits. Scale bars, 40 1m in b1, b1’, b4; 5 um in b1”’, b4’. 


14 MAY 2015 | VOL 521 | NATURE | 219 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Cont siRNA 


c 


< 
> 
inc} 
zx 
fe} 


ARHGAP18 KD 


b 
YAP siRNA ‘ 


kDa kDa 
100 


YAP —: vA? a 3° 
pMRLC [a 20 ARHGAP'S ME (2° 
ARHGAP18 eo. 190 pMRLG 


GAPDH |e 25 


Cont siRNA 
YAP siRNA 
Cont siRNA 
ARHGAP18 
siRNA #1 


| 
GAPDH [a SS 


F-Actin FN 


Q 


F-Actin+FN 


Cont siRNA 


® YAP siRNA 


Cont siRNA 


<x 
Ps 
x 
a 
co 
a 
<x 
10} 
x 
jam 
<x 
WT YAP KD commence $= Onttical F-actin 
Basal bundle 
. 8, Aberrant F-actin 
aggregates 
({{{))/ Normal FN fibrils 
pple ad Aberrant FN fibrils 
9g Adopting 3D tissue shape Tissue alignment 


v 
ARHGAP18 
v 


3D body 
shape 


Integrin 
Fibronectin ey 
= assembly 
—— 
[Tissue tension | 


morphogenesis 
Fibronectin y Filopodia | 


Cortical 
actomyosin 
network 


 Skers 
adhesion 


mim), 


Lens invaginating 
into eye cup 
Cell Neural tube Eye 


Process involving force 


(Fig. 4d). Cortical actomyosin contraction is required for polymerizing 
FN monomers to form fibrils'”; consistently, FN fibril formation on the 
basal surface of control spheroids coincided with cortical F-actin bundles 
(Fig. 4d). In contrast, loss of normal FN fibrils in YAP KD spheroids was 
associated with marked reduction of cortical F-actin bundles (Fig. 4d, f). 
Instead, we observed F-actin aggregates, some of which were associated 
with large FN deposits, suggesting that they have increased local tension 
(Fig. 4d). A similar distribution of F-actin and FN was observed in hir 
(Extended Data Fig. 8a). Gene expression profiling of YAP KD spher- 
oids identified only forty genes with reduced expression (see Methods), 
including ARHGAPI8, encoding a Rho GTPase activating protein that 
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Figure 4 | YAP regulation of tissue tension and FN assembly is mediated by 
ARHGAPI18. a, b, Confocal 3D sectioning of longest and shortest axes of 
YAP and control (Cont) KD RPE1 spheroids (n = 5, 7) after centrifugation. 
b, Ratio of longest (L)/shortest (S) axes. Error bars, + s.e.m. *P < 0.05, t-test 
(Fig. 4 Source Data). c, Immunoblotting of YAP and ARHGAP18 KD spheroids 
for the indicated proteins (Supplementary Fig. 1). d, e, Whole-mount 
imaging of basal surfaces of spheroids transfected with control siRNA (n = 17), 
YAP siRNA (n = 13), and ARHGAP18 siRNA (n = 15), stained for F-actin 
(red) and FN (green). d2-d4, Magnified view of boxed areas in d1. Arrowheads, 
cortical regions; arrows, ectopic F-actin aggregates and aberrant FN fibrils. 

f, Schematic; fine extracellular FN fibrils form in close proximity to cortical 
F-actin in normal cells, while in YAP and ARHGAP18 KD cells, FN fibrils are 
reduced and aberrant FN deposits coincide with ectopic F-actin aggregates. 
g, Schematic summarizing how YAP/ARHGAP18-dependent actomyosin 
network contraction controls tissue shape and alignment. Scale bars, 40 ym 
in a; 30 pm in d1, el; 15 um in d2, e2. 


suppresses F-actin polymerization by inhibiting Rho”. ARHGAP18 tran- 
scripts and protein levels were reduced in YAP KD spheroids (Fig. 4c), 
and ARHGAP18 KD spheroids exhibited a similar phenotype to YAP 
KD spheroids, including reduced pMRLC levels (Fig. 4c) and aberrant 
F-actin and FN assembly (Fig. 4e). This suggests that both disruption of 
cortical F-actin bundles and formation of ectopic F-actin aggregates 
(Fig. 4f) arise from F-actin over-polymerization in YAP KD spheroids 
(Extended Data Fig. 8b) and ARHGAP18 KD cells. Together, these re- 
sults suggest that ARHGAP18 acts downstream of YAP and is required 
for cortical actomyosin network formation and tissue tension. 

To analyse the contribution of actomyosin tension-mediated FN 
assembly defects to the hir eye phenotype, we blocked FN assembly to 
a similar extent to that in hir by overexpressing 70-kDa amino-terminal 
FNla and FN1b fragments in WT embryos” (Fig. 3b3’, b3’’); this 
caused near dislocation of the lens and fewer filopodia between lens 
and retina (Fig. 3b3). hir mutants had fewer filopodia than FN assembly 
blocked embryos (Extended Data Fig. 6a, b), suggesting that contractile 
actomyosin defects in hir exacerbate the incomplete lens dislocation 
caused by FN assembly defects. In contrast, FN assembly blocked embryos 
did not exhibit flattened tissues (Fig. 3b1-b3). Furthermore, the medaka 
FN1 mutant fukuwarai (fku) also exhibited lens dislocation but not 
tissue flattening (Extended Data Fig. 8c), suggesting that FN is specifi- 
cally required for tissue alignment, but not generally for YAP-dependent 
tissue shape. ARHGAP18 mRNA levels were significantly reduced in 
hir,and mRNA injection of plasma membrane-targeted myristoylated 
ARHGAP18 (myrARHGAP18) into hir substantially rescued FN assembly 
defects, lens invagination and body flattening (Extended Data Fig. 9a, b). 
In contrast, inactivation of ARHGAP 18 alone was insufficient to produce 
a recognizable phenotype (data not shown), suggesting that multiple 
ARHGAP18-related genes function downstream of YAP in medaka 
embryos. Consistently, short interfering RNA (siRNA) knockdown 
screening in human cells identified five ARHGAP genes with similar 
functions to ARHGAP18, homologues of which are conserved in me- 
daka and zebrafish (Extended Data Fig. 9c, d and Supplementary Dis- 
cussion). These results suggest that ARHGAP18-related genes function 
as effectors of YAP essential for both tissue shape and FN-dependent 
tissue alignment. The hir phenotype is not simply due to reduced 
myosin contraction, because injecting mRNA of an activated form of 
MRLC-DD” did not rescue the hir phenotype (Extended Data 
Figs 3a6, b, 8d). Similarly, injection of dominant negative MRLC-AA” 
in WT embryos failed to fully phenocopy the hir tissue or body flattening 
phenotype (Extended Data Fig. 3a5, b). Collectively, these results suggest 
that YAP function in 3D tissue shape and FN assembly is conserved in 
human cells and is at least partly mediated by ARHGAP18-related 
genes. 

We propose that YAP is essential for tissue tension, acting through 
ARHGAP18 and related genes to regulate cortical actomyosin network 
formation (Fig. 4g). YAP-dependent actomyosin network tension is 
required for both proper tissue shape and alignment to ensure organ/body 
shape. Several upstream regulators of YAP-mediated cell proliferation 
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have been identified, including cellular environment stiffness, suggest- 
ing that YAP can function as a mechanosensor™. Our data show that 
YAP also functions as a mechanoregulator of tissue tension. Reduced 
cortical actomyosin tension is the most probable cause of attenuated 
tissue tension in hir mutants. F-actin over-polymerization perturbs 
F-actin turnover required for actomyosin contraction in the cytokinetic 
ring”. Our finding that ARHGAP 18, a suppressor of F-actin polymer- 
ization, functions downstream of YAP further supports a critical role 
of F-actin polymerization in contractile actomyosin network forma- 
tion. YAP is required for basal-level actomyosin activity, consistent with 
ubiquitous expression of actin modulator ARHGAP18”, additional to 
which spatiotemporal modulation of actomyosin activity defines tis- 
sue shape. Since ARHGAP18 suppresses actin polymerization, which 
in turn reduces nuclear localization of YAP”°, ARHGAP18 might sup- 
press YAP activity via a negative feedback mechanism. This points to a 
possible mechanical feedback loop where tissue tension controls YAP, 
and YAP in turn is required for tissue tension. 

Actomyosin contraction promotes FN assembly”. The tissue mis- 
alignment phenotype in hir is most likely owing to failure of YAP- 
dependent actomyosin contractility in controlling FN assembly. Since 
EN initiates extracellular matrix organization”, actomyosin contraction- 
mediated FN assembly could be a critical in vivo mechanism that inte- 
grates mechanical signals (for example, tension generated by actomyosin) 
with biochemical signals (for example, integrin signalling). Notably, 
the phenotype of YAP KO mouse embryos resembles that of FN KO 
mouse embryos”, suggesting that YAP and FN have similar functions 
in mouse development. Interestingly, while YAP in medaka is predom- 
inantly required for tissue tension, its paralogue TAZ seems to be required 
for cell proliferation (Supplementary Discussion). Given the high degree 
of conservation of YAP and other Hippo pathway components across 
metazoa””’, it will be worth investigating whether the extent of tissue 
three-dimensionality and alignment correlate with the emergence of 
YAP-mediated resistance to gravity at the evolutionary transition from 
uni- to multicellular organisms. Finally, generation of a simple organ, 
such as an eye cup, from induced pluripotent/embryonic stem cells 
depends on tissue self-organization involving force-mediated pro- 
cesses for which the mechanism remains elusive”. Our finding that 
YAP-dependent force-mediated morphogenesis is required not only 
for 3D tissue morphogenesis but also tissue alignment suggests that 
YAP-dependent force-mediated morphogenesis could be involved in 
self-organization of multiple tissues. Hence, our findings could have 
implications for the generation of complex organs comprising multi- 
ple tissues from induced pluripotent/embryonic stem cells. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Fish maintenance and fish strains. Medaka (Oryzias latipes) and zebrafish (Danio 
rerio) strains were maintained and raised according to previously published pro- 
cedures*’. Medaka and zebrafish embryos were raised in E3 solution at 28 °C. Fish 
care and procedures were approved by the University of Bath Ethical Review Com- 
mittee, and are in compliance with the Animals Scientific Procedures Act 1986 of 
the UK. Medaka WT strains K-Cab, K-Kaga, and the mutant strain hir*?°, fku® 94, 
were used®. Zebrafish WT strain AB and T¢(actb2:myl12.1-eGEP)*, that allow vi- 
sualization of myosin, were used. 

Embryological experiments. For fixation and live imaging, embryos were anaes- 
thetized with 0.01% tricaine. For live imaging, embryos were embedded in 0.8% 
low melting temperature agarose (Type IV-A, Sigma, USA) in 35 mm glass-base 
dishes (Iwaki, Japan) at 28 °C. Standard embryological procedures including, dech- 
orionation, fixation, in situ hybridization, immunohistochemistry, microinjec- 
tion and cell transplantation were carried out according to previously published 
procedures*'. Cells were transplanted to the region fated to become the eye and 
Cuvier’s duct according to our fate map”’. 

Positional cloning of hir. The hir mutation induced in the K-Cab strain was crossed 
with the polymorphic K-Kaga strain to carry out genetic mapping according to a 
previously published procedure**. To map the hir mutation on the chromosome, 
bulked segregant analysis was performed using M-markers** on DNA isolated 
from 48 homozygous mutant embryos and 48 WT siblings from F, embryos of 
mutant X K-Kaga mapping crosses. 

Chromosome walking on chromosome 13 was performed using restriction frag- 
ment length polymorphism markers between K-Cab and K-Kaga strains to map to 
the two BAC clones. For fine mapping, 1,908 meioses were analysed to identify 9 
recombinants mapping hir mutation close to YAP. YAP complementary DNA was 
amplified from hir mutants by RT-PCR and sequenced directly to identify the 
mutation. 

RT-PCRcDNA cloning and construction. Total RNAs were isolated using TRIzol 
(Life Technologies) and were converted to cDNA using the RNA-PCR kit ver.3 
(Takara Bio, Japan) followed by PCR using KOD plus polymerase (Toyobo, 
Japan). For mRNA production, PCR amplified full-length cDNAs (medaka YAP, 
70KDaFN1a,b, ARHGAP18) were cloned into pCS2+ and for in situ hybridization 
medaka sox3 cDNA was cloned into pBluescript II SK(—). pCS2+ myr-ARHGAP18 
was constructed by adding the myristoylation sequence using oligonucleotides to 
produce myristoylated ARHGAP18 mRNA. mRNAs were synthesized using SP6 
mMESSAGE mMACHINE Kit (Ambion, USA). Primer sequences are shown in 
Supplementary Table 5. 

Gravity experiment. Dechorionated embryos were embedded in 0.8% low melt- 
ing temperature agarose in three orientations against gravity at st. 17, fixed at st. 25 
and subjected to cryosectioning to determine the direction of tissue/organ col- 
lapse. Collapse of embryos towards gravity was assessed using images of sections 
stained with TO-PRO-3 and phalloidin. 

Microinjection. mRNA, DNA and Morpholino were injected at 1-cell or 8-cell 
stages to deliver them to all cells or in a mosaic manner. The volume of one-shot of 
injection was 0.5 nl. 

Phenotypic rescue experiments. Embryos from hi pr heterozygote crosses were 
injected with mRNA of YAP variants. For transplantation phenotypic rescue exper- 
iments, embryos were genotyped by PCR using primers (Supplementary Table 5). 
Morpholino KD analysis in medaka and zebrafish. Morpholino oligonucleotide 
(MO) from Gene Tools (USA) were used (Supplementary Table 6). Specificity of 
KD by MO was confirmed in a slightly different manner in medaka and zebrafish. 
Since rescue of the phenotype by mRNA injection did not work effectively in zebra- 
fish, three different types of MOs, translation blocking (TB), splicing blocking (SB) 
and 5’ UTR MOs, were used and all were confirmed to induce a similar phenotype. 
In medaka, TB and SB MOs were used, and the phenotype was rescued by co- 
injecting corresponding mRNAs. To determine efficiency of KD, semiquantitative 
RT-PCR was carried out using primers that distinguish defective splicing from 
normal forms of mRNA (Supplementary Table 5). 

Immunohistochemistry. Embryos were fixed in either 4% paraformaldehyde 
(PFA), Dent fixative or 1% TCA for 1-3 days and subjected to cryosectioning as de- 
scribed previously*’. Antibodies used were: anti-FN antibody (Ab), Sigma F3648 at 
1:100; B-integrin monoclonal Ab, 8c8 (Developmental Studies Hybridoma Bank, 
USA) at 1:10; anti-aPKC C-20 (SC216, Santa Cruz Biotech, USA) at 1:100; anti- 
PCNA (PC10, Santa Cruz Biotech, USA) at 1:500; anti-laminin (Ab-1, NeoMarkers, 
USA) at 1:100 and anti-ZO-1°° (gift from M. Itoh) at 1:1. Sections were counter- 
stained with Alexa Fluor 488 or 546 Phalloidin (A12379, A22283, Invitrogen USA) 
at 1:250 and TO-PRO-3 (T3605, Invitrogen, USA) at 1:1,000. 

Time-lapse microscopy and image analysis. Time-lapse analysis of lens disloca- 
tion was carried out using a Leica MZ16FA dissecting microscope. Confocal micro- 
scopy used a Leica TCS SP5 and images were analysed by Imaris 7.3 (Bitplane, 
ANDOR Technology, UK) and Amira 5.1 (Visage Imaging, USA). Cell division 


orientation (0) of telophase cells in time-lapse sequences was determined by 
drawing an axis from the ventricular zone-attached non-moving daughter cell 
(asterisk Extended Data Fig. 5c) towards the non-attached moving daughter cell’’. 
The acute angle of this axis was then measured against the axis of the ventricular 
zone. Imaging was carried out dorsal side down using an inverted microscope. Rose 
diagrams were generated using Oriana v4 (Kovach Computing Services, UK). 
Spheroid analysis. hTERT-RPE1 cells (American Type Culture Collection; CRL- 
4000) were seeded (2 X 10° cells per well in 6-well plates). Each stealth RNA 
(100 pmol) of Opti- Mem medium (Life Technologies) was transfected using Lipo- 
fectamine RNAi Max (Life Technologies) followed by incubation for 24h at 37 °C. 
Trypsin treatment was used to collect RNAi-transfected cells from wells which 
were resuspended in 2 ml of 10% FBS (Hyclone, ThermoFisher Scientific)- DMEM. 
These resuspensions were seeded to 6 wells of a 12-well plate (Hydrocell, CellSeed 
Japan) and incubated for 48 h at 37 °C. Spheroids were fixed in 3% formalin and 
subjected to immunostaining. Reagents used for immunostaining: anti-B-catenin 
(BD transduction, 610154, 1:200), anti-FN (Sigma F3648, 1:500), Alexa Fluor 546 
Phalloidin (Invitrogen, A22283,1:200). For the list of primers see Supplementary 
Table 5. 

Western blotting. Spheroids were lysed in lysis buffer (0.5% TritonX-100, 150 mM 
NaCl, 20 mM Tris-HCl pH 7.5). The lysates were sheared with a 21-gauge needle, 
incubated on ice for 30 min and clarified by centrifugation at 20,817¢ for 15 min at 
4°C. The extracted proteins were separated by SDS-PAGE and transferred to 
immobilon transfer membrane (Millipore) for western blotting analyses. The prim- 
ary antibodies were anti-YAP1 pAb (4912 Cell Signaling, 1:500), anti-fibronectin 
pAb (F3648, Sigma Aldrich, 1:1,000), antic ARHGAP18 pAb (1:10,000)'”', anti- 
MYH9 pAb (3403 Cell Signaling, 1:1,000), anti-Phospho Ser1943-MYH9 pAb 
(5026 Cell Signaling, 1:1,000), anti- MYH10 mAb (8824 Cell Signaling, 1:1,000), 
anti-Phospho-Ser19 MLC2 (3675, Cell Signaling, 1:100), and anti-GAPDH mAb 
(sc32233, Santa Cruz, 1:5,000). 

Actomyosin tension measurement by laser cutting. Laser cutting experiments 
were carried out using a UV-laser ablation system as previously described’’. 
Tg(actb2:myl12.1-eGFP)” embryos were mounted in 1% low melting point agarose 
(Invitrogen) embedded in E3 medium inside a glass bottom Petri dish (Mattek). A 
63X water immersion objective (NA = 1.2, Zeiss) was used to visualize the YSL 
actomyosin ring at respective epiboly stages. Cuts were made at a distance of 20 pm 
from the EVL/YSL boundary by applying 25 UV pulses at 1 kHz to 40 equidistant 
sites along a 20-11m-long line perpendicular to the EVL margin as depicted in Fig. 2e. 
Fluorescent images of embryos were captured using an iXxon DU-897-BV camera 
(Andor Technology) with a 380 ms exposure time and 500 ms frame rate (LabVIEW 
v10.0.1). The ablation procedure itself took 1.2s during which no images were 
acquired. Temperature was kept constant at 28.5 + 1 °C throughout the experiment 
by means of a custom-built temperature chamber and an objective heating ring. The 
recoil velocity of the cortex in response to the cut opening was analysed using 
customized Matlab (v7.12) scripts based on particle image velocimetry (PIV) as 
previously described’’**. The component of the PIV flow field that is orthogonal 
to the cut line was averaged in two adjacent rectangles (Fig. 2f) for time frames up 
to 9s post-ablation. The resulting recoil velocity curves for single embryo ablation 
experiments were averaged to yield the mean temporal recoil velocity curve for the 
depicted conditions (Fig. 2g). Laser ablation experiments that caused wound response 
recognizable by a strong accumulation of myosin following the ablation were dis- 
carded from the analysis. In these experiments leakage of yolk cytoplasm through 
a membrane opening may interfere with the cortical tension measurements”. 
Micropipette aspiration analysis. The whole neural tube was dissected out from 
st.22 medaka embryos and was cut using a tungsten needle at the level of dience- 
phalon-midbrain boundary. The micropipette was connected to a Microfluidic Flow 
Control System (Fluigent, Fluiwell) which was controlled via a custom-programmed 
Labview (National Instruments) interface. In the BSS medium, the neural tube was 
aspirated from the open end by a micropipette (internal radius = 30-35 jum) at a 
constant pressure (AP = 4.5 mbar) for 10 min. Aspiration was imaged at 500 ms 
intervals by a Leica SP5 inverted confocal microscope using a Leica 20X, 0.7 NA 
objective. Temperature in the dish was kept constant at 28 °C by a heated sample 
holder. Measuring the tongue length of the tissue within the micropipette using 
FIJI software over time yielded the characteristic tissue flow curves during aspira- 
tion for WT and hir mutant neural tube explants. To reduce cortical tension WT 
neural tube explants were treated with ROCK inhibitor Y27632 (250 1M dissolved 
in water) for 15 min before performing the micropipette aspiration experiment. 
Oligo DNA microarray analysis. For the Oligo DNA microarray analysis, total 
RNA samples were collected from hTERT-RPE1 multicellular spheroids. 3D-Gene 
Human Oligo chip 25k (TORAY) was used. Total RNA of YAP siRNA-transfected 
spheroids and that of negative control siRNA were labelled with Cy3 or Cy5 using 
the Amino Allyl MessageAMP II aRNA Amplification Kit (Life Technologies), 
respectively. The Cy3- or Cy5-labelled aRNA pools and hybridization buffer were 
mixed, and hybridized for 16h at 37 °C. The hybridization was performed using 
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the supplier’s protocols (http://www.3d-gene.com). Hybridization signals were 
scanned using a 3D-Gene Scanner 3000 (TORAY). Detected signals for each gene 
were normalized by a global normalization method (Cy3/Cy5 ratio median = 1). 
Genes with Cy3/Cy5 normalized ratios greater than 2.0 or less than 0.5 were defined, 
respectively, as commonly up- or downregulated genes. The results were deposited 
at GEO under the accession number GSE54146. 

Quantitative RT-PCR analysis. Total RNA was isolated from WT and hir mutant 
embryos at various developmental stages using TRIzol (Invitrogen) according to 
the manufacturer’s instructions. First-strand cDNA was synthesized from 1 1g 
total RNA using Superscript III reverse transcriptase (Invitrogen) with an oligo- 
dT primer. Each quantitative real-time RT-PCR was performed using the CFX96 
real-time PCR detection system (Bio-Rad). Primers used for RT-PCR analysis are 
shown in Supplementary Table 5. Fora 10 pl PCR, cDNA template was mixed with 
the primers to final concentrations of 250 nM and 5 Ll of SsoFast EvaGreen Super- 
mix (Bio-Rad), respectively. The reaction was first incubated at 95 °C for 3.5 min, 
followed by 45 cycles at 95 °C for 30s, 65 °C for 30s and 72 °C for 30s. 
Phylogenetic analysis of ARHGAP18-related genes in 11 metazoan species. 
Lists of homologues of ARHGAP18 family (TF314044) and its closely related families 
ARHGAP6 (TF316710) and ARHGAP11 (TF332212) in 11 metazoan model species 
were downloaded from the Treefam database. Amino-acid sequences for these genes 
were downloaded from Ensembl. Multiple sequence alignment was performed using 
the PRANK package. This alignment was used to infer the phylogenetic relation- 
ship of these genes using maximum likelihood using FastTree 2.1. 

ARHGAP siRNA screening in HeLa cell line. A library of siRNAs targeting human 
GAPs was obtained from Invitrogen. HeLa cells cultured in 24-well plates were 
transfected with siRNAs (20 nM) using Lipofectamine RNAiMAX. After 72 h, cells 
were fixed with 4% PFA and stained with FITC-labelled phalloidin (Invitrogen). 
Images were taken using an Olympus IX71 fluorescence microscope. 

Statistical analyses. Statistical significance between WT and mutant groups was 
tested using independent two-tailed t-tests (for two-way comparisons) and one-way 
ANOVAs (for multiple comparisons), with a Dunnett’s T3 post-hoc where neces- 
sary, in SPSS 20 (IBM) or Prism v5.0 (GraphPad). The Dunnett’s T3 post-hoc 
assumes variances to be unequal and allows comparisons of groups with different 
n numbers. To test for differences in mitotic orientation between WT and hir we 
performed the Kolmogorov-Smirnoy (KS) test (http://www.physics.csbsju.edu/ 
stats/KS-test.n.plot_form.html). The KS test makes no assumptions about the dis- 
tribution of data being tested. Sample size was not pre-determined. We repeated 
experiments a minimum of three times with sufficient n numbers for each repeat 
to be confident that reported results are representative. Randomization was not 
applied to allocate embryos to experimental groups. Blinding to group allocation 
was not used. Error bars on graphs show + standard error of the means (s.e.m.), 
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except when stated otherwise. Data points that deviated by more than + 3X the 
standard deviation of the sample mean were excluded from analysis. 

P values and sample sizes. P-values vs WT unless specified. Fig. 1c: ncont = 39, 
pip = 25 (P = < 0.001), MmyapKpnir = 24 (P = < 0.001); Amy apKDhir+YAPmRNA = 22 
(P = 1.000). Fig. 2b: nwr = 26, mnie = 14 (P= 0.0001). Fig. 4b: neontsinwa = 75 
Ny apsinna = 5 (P = 0.023). Extended Data Fig. 2h: tonto = 20, Nzeyapramo = 11 
(P= <0.001), nzryap-razrs = 10 (P= <0.001). Extended Data Fig. 2i: nwr 
st.20 = 5, st.22 = 8, st.24 = 13, st.26 =6, st.28 = 10, ny;, st.20 = 8, st.22 = 4, 
st. 24 = 6, st.26 = 9 (P = 0.0284), st. 28 = 5 (P = 0.0088), nrazmo st. 20 = 5, st.22 = 5, 
st.24 = 10, st. 26 = 12, st.28 = 12. Extended Data Fig. 2): nw st.20 = 11, st.22 = 7, 
st.24 = 10, st.26 = 11, st.28 = 11, ny;, st.20 = 7, st.22 = 7, st.24 = 11, st.26 = 13 
(P = 0.0158 vs TAZMO st. 26), st.28 = 7 (P = 0.0075 vs TAZMO st. 28), nrazmo 
st. 20 = 5, st.22 = 5, st.24 = 10 (P = 0.0007), st. 26 = 8 (P = 0.0008), st. 28 = 6 (P= 
0.0120). Extended Data Fig. 3b: nwr = 174, npir = 70 (P = < 0.001), tmyapKpnir = 85 
(P= <0.001), nuricaa = wr = 135 (P= < 0.001), Mricpp=hir = 92 (P = 0.1830 
vs hir). Extended Data Fig. 4b: nw = 3, nnir = 3. Extended Data Fig. 5b: nwr cell 
stacking = 9, cell slippage = 8, parallel division =5, mpi cell stacking = 3 
(P = <0.01), cell slippage = 21 (P= <0.05), parallel division =5. Extended 
Data Fig. 5d: KS-test, P= 0.01, nwr st. 22-24 = 32, st. 25-26 = 13, mpir st.22- 
24 = 14, st.25-26 = 20. Extended Data Fig. 6b: nwr = 10, n7oKparn>wr = 13 
(P = 0.0032), ji = 6 (P = 0.0001), nyapsava>nir = 10 (P = 0.0013 vs hir). 
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Extended Data Figure 1 | YAP is mutated in hir mutants. a, In situ 
hybridization of sox3 showed that the lens placode (arrowhead) is specified in 
hir mutant embryos (n = 3) at st.21. At st.22, the nascent lens invaginated in 
WT (n = 21), but not in hir mutant embryos (n = 13, arrowhead). b, Two 
frames from time-lapse imaging of retina of embryos injected with membrane 
EGFP and nuclear red fluorescent protein (MNFP) mRNAs. In WT (n= 10), 
the nascent lens invaginates from st.21 (b1, margins of the lens indicated by 
arrowheads with retina to the right), whereas in hir (n = 7) the lens mostly 
detached from the retina (b2’, arrowheads show lens remnants attached to the 
retina). Scale bars, 80 um in a; 30 Um in b. ¢, Nine recombinants in 1,908 
meiosis mapped hir close to the YAP gene on chromosome 13 (R: recombinant, 
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C: non-recombinant embryos). d, YAP cDNA encodes six protein binding 
domains/motifs and one transcription activation (TAc) domain; a non-sense 
mutation in WW1 domain in hir. e, RT-PCR analysis of YAP mRNA during 
development. f-actin as control. f, mRNA of normal YAP and its variants 
were injected into hir mutants. The numbers represent: hir phenotype rescue 
judged via brain thickness, heart migration and Cuvier’s duct formation; 
mutants (judged by genotyping when necessary); survived injected embryos 
of hir*!~ crosses. High dose (400 pg) mRNA of YAP" variant was injected 
into WT embryos to examine dominant-negative effects. The rescue by YAP*** 
variant required only 20% of the amount required to rescue using normal 
YAP mRNA. 
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Extended Data Figure 2 | Morpholino knockdown in medaka and zebrafish. 
a, Design of medaka YAP TB and SB MOs relative to translation start (ATG), 
exons (numbered boxes) and introns. Primers (arrows) used to assess the 
efficiency of SB MO KD. b, Upper panel, proper splicing of YAP transcripts 
(579 bp) was nearly fully blocked (343 bp, <5% of normal level) by YAP SB MO 
(5 ng), assessed by RT-PCR; lower panel, f-actin control. c, WT embryos 
injected with YAP TB MO and standard control MO. c1-c3 Dorsal and 
c1’-c3’ lateral views (also Supplementary Table 1). Arrowheads indicate 
location of heart progenitors. Body flattening and bilateral cardiac progenitor 
cell migration was affected in a dose-dependent manner. c2, 2’, Bilateral 
cardiac progenitor cells fused at the midline but did not migrate anteriorly; 
cl, cl’ their migration arrests next to the ears at the high dose. The two distinct 
YAP morpholinos (YAP TB and SB MOs) mimicked the hir phenotype (hir 
mutants have a cardia bifida phenotype (arrowheads in Fig. lal and al’)) ina 
dose-dependent manner. To further verify specificity of the YAP MOs, YAP TB 
MO was co-injected with human YAP mRNA that does not hybridize with 
the YAP TB MO. Injection of YAP TB (but not YAP SB) MO into hir mutant 
embryos enhanced the blastopore closure phenotype of hir mutants (Fig. 1b, c, 
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Supplementary Table 2). These maternal YAP KD hir mutant embryos failed to 
close the blastopore. Less than half the amount (2 ng) of YAP TB MO was 
required for causing this phenotype in hir mutants compared to that required 
for WT embryos (5 ng). This blastopore closure phenotype was rescued by 
medaka YAP mRNA (200 pg) co-injection. d-g, Zebrafish (ZF) WT embryos 
injected with three distinct ZFYAP MOs (TB, 5’ UTR and SB) exhibit 

the blastopore closure phenotype as in medaka (Supplementary Table 3). 
Efficiencies of ZF YAP and TAZ SB MO KD (1.5 ng each) were assessed by RT- 
PCR using primers in d, f, respectively as in a, b. As reported by Gee et al., co- 
injection of ZF YAP mRNAs did not rescue the ZF YAP MO phenotype in 
zebrafish'’. h, Co-injection of ZF TAZ MO (total 2 ng) enhanced slow epiboly 
of YAP TB KD-injected embryos; control = 89 + 4.16% (n = 20), YAP 

KD = 70.09 + 4.7% (n = 11), YAP/TAZ KD = 52.5 + 2.64% (n = 10). Error 
bars show + s.e.m. ***P < 0.001, one-way ANOVA. y axis shows percentage 
epiboly. i, j, TUNEL for cell death and phosphohistone H3 (PH3) antibody 
staining for cell proliferation (see methods for sample sizes). Stained cells in the 
neural tube were counted. Error bars indicate + s.e.m. *P < 0.05, **P< 0.01, 
***P < 0.001, one-way ANOVA. 
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Extended Data Figure 3 | Anisotropic enveloping layer cell shape analysisin | MRLC-DD (constitutive active form) mRNA-injected hir embryos (n = 4). 
hir mutants. al, Schematic of sectional view of blastoderm margin of a b, EVL shape anisotropy quantification by the length/width ratio (LWR, shown 
gastrulating embryo (T], tight junction; AR actomyosin ring; YSL, yolk in a2) of marginal EVL cells (up to 4 rows back from the EVL/YSL boundary, 
syncytial layer; EVL, enveloping layer); a2—a6, EVL shape was visualized in shown in Fig. 2d bracket). While EVL shape anisotropy was reduced in hir 
phalloidin-stained fixed medaka embryos at 75% epiboly (st. 16, 21 hpf) and mutant embryos (a3) to a level comparable to that of MRLC blocked embryos 


compared among, 2 WT (n = 14); 3 hir (n = 9); 4 maternal YAP KD hir (a5), activation of MRLC in hir (a6) did not rescue it. Parentheses indicate 
mutants (mYAPKDhir) by TB MO-injection into hir embryos (n = 12), 5 number of cells measured. Scale bar 30 um. Error bars represent + s.e.m. 
MRLC-AA (dominant negative form) mRNA-injected WT (n = 6); and 6, ***P < 0.001, one-way ANOVA. 
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Extended Data Figure 4 | Flattening of the hir neural tube is associated 
with string-like cell arrangements. a, Increasing height (indicated by brackets 
in al and a5) of WT neural tube (outlined, n = 10) was associated with cell 
stacking. Time in minutes from st. 21 shown bottom left of each sub-panel. Red 
fluorescent cells, for example, cell 1 in al, labelled by photo-converting Kaede 
fluorescent protein, rounded up at the ventricular zone arrowhead in a2 and 
divided along the ventricular zone (perpendicular cell division in a3) to 
generate stacked daughter cells 1-1, 1-2, making the neural tube thicker in a5. 
b, Width/height ratio of spinal cord, measured from time-lapse imaging of 


single embryos (WT, hir n = 3 each), showed that flattening occurred 
progressively in hir. Error bars are + s.e.m. (see Source Data). ¢, Single-cell 
tracking of clones (labelled by membrane-GFP and nuclear-RFP) of the 
growing neural tube at the level of the fifth somite. Lower panels for WT and hir 
show magnified views of shaded regions in upper panels. The flatter and 
wider neural tube of the hir mutant at st. 27 was associated with long chain-like 
cell arrangements (asterisks, bottom panels of hir) tracked from a single 
neuroepithelial cell at st.22, compared with the thick cell group generated by 
cell stacking in WT embryos. Scale bars, 40 jm. 
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Extended Data Figure 5 | Flattening of the hir neural tube is associated with 
cell stacking failure. a—d, Single-cell analysis in hir neural tube shows cell 
stacking failure occurred after mitosis (a, b) and during mitosis (c, d). Neural 
progenitor cells divided with spindle orientation ‘perpendicular’ or ‘parallel’ to 
the ventricular zone (‘perpendicular’ or ‘parallel’ cell division, respectively). 

a, While daughter cells (asterisks) in WT remained stacked after 45 min 
following perpendicular cell division (first row), those in hir exhibited cell 
slippage (second and third rows). Telophase neuroepithelial cells in the neural 
tube, first column; magnified views in second to fourth columns. Dotted 
lines show division planes. Two types of cell slippage were observed: ventral 
slippage (VS) where the dorsal daughter cell slipped towards the ventral 
(second row), and dorsal slippage (DS) where the ventral daughter cell slipped 
towards the dorsal (third row). After parallel cell division, daughter cells did not 


st.22-24 St.25-26 


n=32 


10° op 


hir 


change their positions in hir (fourth row). b, Cell stacking was reduced and cell 
slippage increased after perpendicular cell division, but cells after parallel cell 
division remained unaltered in hir mutants. Cell numbers in parentheses. 
Error bars, + s.e.m. *P < 0.05, **P < 0.01, t-test (see Source Data). c, During 
perpendicular mitosis, daughter cells did not stack properly in hir mutants. Cell 
division orientation (@) was measured in time-lapse sequences as the acute 
angle of the telophase cell axis against that of the ventricular zone (for example, 
dotted line 26° in c). d, Rose diagrams showing frequency and angle of 
parallel cell divisions. At st.25-26 (50-54 hpf) perpendicular cell divisions 
generated stacked cells against gravitational forces in WT (n = 3 embryos at 
both stages). Far fewer stacked cells were observed in hir (n = 4 embryos at 
st.22-24, n = 3 embryos at st. 25-26). These results are illustrated in Fig. 3a. 
Scale bars, 15 [im in a, 40 um in c. 
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Extended Data Figure 6 | Detachment of lens is associated with loss of 
filopodia in hir. a, Representative live images of filopodia (arrowheads) from 
single lens cells (asterisks) expressing Lifeact-GFP in a mosaic manner; 

al, WT; a2, hir and a3, 70kDaFN mRNA-injected WT embryos at st. 21.5 when 
lenses are detaching in hir mutants (see Extended Data Fig. 1b for larger 
views). a3, Non-mosaic expression of 70kDaFN mRNA in WT embryos was 
confirmed by co-injected H2A-red fluorescent protein (RFP) in the nucleus 
(red). L, lens; R, retina. b, Filopodia number per cell was determined (see 


LETTER 


Extended Data Fig. 7b4 for YAPS87A injected hir embryos). n, number of 
analysed embryos. Error bars indicate + s.e.m. **P < 0.01, ***P < 0.001, 
one-way ANOVA (Extended Data Fig. 6 Source Data). c, Transverse section of 
integrin-B1 IHC. Strong integrin-B1 localization between lens and retina in 
st.22 WT (n = 2) (cl, arrowhead); no such localization in hir (n = 3) (€2). 
At st. 23 in hir (n = 3), weak localization where rounded up lens reattached to 
retina (c2’, arrowhead). Scale bars, 10 um in a; 40 pm in c. 
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Extended Data Figure 7 | The hir mutation acts cell non-autonomously. 

a, Mosaic expression of EGFP-YAPS87A by mRNA injection at 16-cell stage in 
hir mutant embryos rescued the hir eye phenotype in a2 compared to al (WT) 
and a3 (hir). The boxed area in a2 is magnified in the lower panels (a2’-a2’’’) 
fluorescence, merged and bright-field views, respectively. Arrowheads in a2’ 
indicate EGFP-Y APS87A-expressing clones. b, Non-cell autonomous rescue of 
filopodia in hir mutant lens cells. YAPS87A+ mCherry-CAAX (labels 
membrane red) mRNA, and Lifeact-EGFP mRNA (labels F-actin green) were 
injected into different cells at 8-16 cell stage. b1, In the invaginated (arrow) hir 
mutant lens (boxed area magnified in b2 and b3, n = 10) rescued by mosaic 


expression of YAPS87A (red), YAPS87A non-expressing mutant cells 
recovered filopodia (arrowheads in b4, magnified view of b3). Filopodia 
number/cell was compared between WT and hir in Extended Data Fig. 6b. 
cl, Cells from donor embryos injected with rhodamine (red, top left) were 
transplanted to a recipient embryo (top right, blastula stage st. 12) at the 
location fated to be eyes (bottom, animal pole view). c2, WT; c3, hir and 

c4, WT cells transplanted into hir mutant eye, causing the lens (arrowhead) to 
invaginate into the retina as in WT at st. 23 (note that this confocal sectional 
view represents a fraction of transplanted cells in the whole eye, see 
Supplementary Table 4 for the frequency of rescue). Scale bars, 40 jum. 
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Extended Data Figure 8 | F-actin and FN localizations in hir. a, Whole- 
mount imaging of WT (n = 5) and hir (n = 4) embryos stained for F-actin (red) 
and FN (green). al, al’, Whole dorsal view of embryos anterior up, only FN 
shown; a2-a4, a2’-a4’, magnified view of area indicated by asterisks in al, 
al’; merged a2, a2’, F-actin a3, a3’ and FN a4, a4’. Arrowheads indicate 
cortical F-actin and FN fibrils in WT and corresponding region in hir 

(a3, a4, a3’, a4’); arrows show ectopic F-actin aggregates and aberrant FN 
fibrils in a3’, a4’. b, Immunostaining of 2D cultured RPE1 cells transfected 
with control (Cont, n = 21) and YAP siRNAs (n = 19) stained with 
Phalloidin (b1, b1’), B-catenin (b2, b2’) and merged with DAPI (b3, b3’); 
phalloidin (b4, 4’), FN (b5, b5’) and merged with DAPI (b6, b6’). In 
marked contrast to the 3D spheroids, FN deposits were not altered in YAP KD 
cells (b5, b5’) despite increased F-actin stress fibres (b1, b1’ and b4, b4’). 
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c, The medaka fku mutants exhibit lens dislocation (arrows). Live dorsal view of 
the head of 1, WT; c2, fku and ¢3, hir mutant embryos at st. 24. c4, The fku 
mutation was mapped to LG21 to the region encompassing the FN1 gene 

(0 recombinants/1,130 meiosis). Positional cloning identified a non-sense 
mutation of Glu593 (GAA to TAA) in FN1 (2,503 amino acids). FN1 
morpholino KD in WT embryos mimicked the fku mutant phenotype. 

d, Constitutive-active MRLC-DD mRNA markedly increased body thickness of 
WT embryos, but did not rescue the flattened body (brackets in lower panels) 
and dislocated lens phenotypes of hir (n = 48). Upper panels, live lateral 
view (insets, dorsal views of left eyes); lower panels, frontal sections stained with 
phalloidin (red) and TO-PRO-3 (blue) at st. 25. Scale bars 30 jum, except 

a2, 15 im and bl, 50 pm. 
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Extended Data Figure 9 | in vivo analysis of ARHGAP18 function. 

a, Quantitative RT-PCR analysis showed that ARHGAP18 mRNA expression 
in the hir mutant is significantly reduced to 76% of WT level. EFla used as an 
internal control. Data are shown as means + s.e.m. (n = 10 each; *P < 0.001 
Student’s t-test (two-tailed)). b, myrARHGAP18 mRNA (150 pg) injection 
rescued the hir phenotype (21 rescued/39 hir/112 survived embryos). Upper 
panels, live dorsal view; lower panels, frontal sections stained with phalloidin 
(red) and TO-PRO-3 (blue) at st. 23; b1, uninjected hir, b2, injected hir and 
b3, WT. The lens (asterisk) invaginated into retina (arrows, upper panel) 

and the neural tube became thicker (brackets in lower panels) in the 
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myrARHGAP18 mRNA-injected hir mutant embryos. b2’, FN staining of 
mytARHGAP18 mRNA- injected hir mutant embryos; boxed area magnified in 
subsequent panel to the right; invaginated lenses had fine FN fibrils 
(arrowheads) between lens and retina as in WT (see Fig. 3b1’’). c, Phylogenetic 
analysis identified 16 ARHGAP18 paralogues in vertebrate lineages. 
Arrowheads show medaka orthologues. d, siRNA screening of 40 human 
ARHGAP genes in HeLa cells showed that KD of five ARHGAP genes 


exhibited the rounding up phenotype similar to ARHGAP18 inactivation. 
Scale bars, 30 [um. 
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Clinical improvement in psoriasis with specific 
targeting of interleukin-23 


Tamara Kopp ?*, Elisabeth Ried1**, Christine Bangert'*, Edward P. Bowman’, Elli Greisenegger', Ann Horowitz‘, Harald Kittler?, 
Wendy M. Blumenschein’, Terrill K. McClanahan*, Thomas Marbury”, Claus Zachariae®, Danlin Xu‘, Xiaoli Shirley Hou’, 
Anish Mehta‘, Anthe S. Zandvliet*, Diana Montgomery*, Frank van Aarle* & Sauzanne Khalilieh* 


Psoriasis is a chronic inflammatory skin disorder that affects 
approximately 2-3% of the population worldwide and has severe 
effects on patients’ physical and psychological well-being’*. The 
discovery that psoriasis is an immune-mediated disease has led to 
more targeted, effective therapies; recent advances have focused on 
the interleukin (IL)-12/23p40 subunit shared by IL-12 and IL-23. 
Evidence suggests that specific inhibition of IL-23 would result 
in improvement in psoriasis. Here we evaluate tildrakizumab, a 
monoclonal antibody that targets the IL-23p19 subunit, in a 
three-part, randomized, placebo-controlled, sequential, rising 
multiple-dose phase I study in patients with moderate-to-severe 
psoriasis to provide clinical proof that specific targeting of IL-23p19 
results in symptomatic improvement of disease severity in human 
subjects. A 75% reduction in the psoriasis area and severity index 
(PASI) score (PASI75) was achieved by all subjects in parts 1 and 3 
(pooled) in the 3 and 10 mg kg"! groups by day 196. In part 2, 10 out 
of 15 subjects in the 3mgkg ' group and 13 out of 14 subjects in 
the 10 mg kg" ' group achieved a PASI75 by day 112. Tildrakizumab 
demonstrated important clinical improvement in moderate-to-severe 
psoriasis patients as demonstrated by improvements in PASI scores 
and histological samples. 

Ustekinumab, a monoclonal antibody that binds the IL-12p40 sub- 
unit shared by IL-12 and IL-23 (IL-12/23p40), has demonstrated efficacy 
in the treatment of psoriasis**. A previous study showed that usteki- 
numab did not decrease IL-12p35 in human subjects, suggesting that 
targeting IL-23 was more critical than IL-12 (ref. 6). The IL-23p19 
subunit pairs with the IL-12/23p40 subunit to form the heterodimeric 
cytokine IL-23. Both subunits are overexpressed in psoriasis plaques’. 
Human association data and pre-clinical studies strongly suggest that 
IL-23, not IL-12, drives psoriasis. 

We conducted this phase I study in subjects with moderate-to-severe 
plaque-type psoriasis to evaluate the clinical activity, safety and pharma- 
cokinetics of tildrakizumab (also known as MK-3222 or SCH 900222), 
a monoclonal antibody that specifically binds the IL-23p19 subunit of 
IL-23 to neutralize its function, but is unable to bind the IL-12 hetero- 
dimer of IL-12/23p40 and IL-12p35 (Extended Data Fig. 1). Evidence 
shows that IL-23 and its downstream effector cytokines (for example, 
IL-17A, IL-17F, IL-22 and TNF secreted by « T cells (T helper 17 (Ty 17), 
Ty22, T cytotoxic 17 (Tc¢17), T¢22), v6 T cells, regulatory T cells, natural 
killer T cells, natural killer cells, type 3 innate lymphoid cells, neutrophils 
and mast cells) are observed in psoriasis'°”’. Recent clinical data show 
that antagonism of downstream cytokine IL-17A has a positive effect in 
psoriasis, suggesting that an IL-23 antagonist would also be effective’**. 

Baseline characteristics are shown in Extended Data Table 1. In total, 
65 of the 77 enrolled subjects completed the study; 3 were lost to follow 
up (that is, subjects did not return/ could not be reached; 2 subjects 
were on 0.1 mgkg_ '; 1 subject was on placebo); 2 discontinued owing 


to adverse events (1 subject on 10 mgkg ' due to convulsions; 1 subject 
on placebo due to arthralgia and worsening psoriasis); and 7 withdrew 
consent (1 subject on 0.05 mg kg” ';2 subjects on 3 mg kg’ ';4 subjects 
on placebo). In part 1, subjects were randomized to intravenous injec- 
tions of placebo (n = 6) or 0.1 mg kg! (n = 3), 0.5mg kg! (n = 3), 
3mgkg '(n=6) or 10mgkg ‘| (n = 6) tildrakizumab on days 0, 56 
and 84. In part 2, subjects were randomized to placebo (n = 11) or 
3mgkg ‘(n= 15) or 10mgkg ' (n = 14) tildrakizumab on days 1, 
28 and 56. In part 3, subjects received placebo (n = 3) or 0.05 mgkg * 
(n = 6) or 0.1 mg kg! (n = 3) tildrakizumab on days 1, 56 and 84 (Ex- 
tended Data Fig. 2). 

In this study, disease activity measures were significantly improved 
after tildrakizumab. Tildrakizumab 0.05-10 mgkg ' resulted ina mean 
placebo-corrected reduction in PASI score of 50-80% on day 112 witha 
sustained response at day 196 (Fig. 1). In part 2, a clinically meaningful 
response (mean placebo-corrected decrease in PASI score of 50%) was 
still observed on study day 308 (36 weeks after the last administered 
dose). All 3 and 10mgkg ' subjects achieved a 75% reduction in the 
PASI score (PASI75) in part 1 by day 196 and a majority achieved PASI75 
in part 2 by day 112 (Extended Data Table 2). A large proportion of 
3 and 10mgkg ‘ subjects also achieved PASI90 response by day 112 
(Extended Data Table 2). A representative subject’s clinical photographs 
demonstrate visual clearance of psoriatic lesions (Fig. 1). 

Tildrakizumab was well tolerated up to the maximum dose evaluated 
(10 mgkg * intravenously once monthly). The most common adverse 
effects included headache, nasopharyngitis, upper respiratory infection 
and cough (Extended Data Table 3). There was no dose-related increase 
in adverse events or abnormal trends in clinical safety laboratories, vital 
signs and electrocardiograms (ECGs). There were 11 serious adverse 
events reported in 8 subjects (Extended Data Table 4). One serious 
adverse event (convulsions) was deemed to possibly be related to study 
medication due to the timing of the dose (17 days after dosing with 
10 mgkg '); several confounding factors in this subject include an 
extended lack of sleep, alcohol consumption and use of benzodiazepine 
(diazepam), and the subject’s attempt to reduce the use of these agents. 
This phase I trial was limited in scope and sample size; further research 
will provide a full assessment of safety and tolerability of tildrakizumab. 
Previous research supports that IL-12 is important for intracellular patho- 
gen immunity and tumour surveillance’>"*; however, the IL-12/23p40 
antagonist ustekinumab has a robust efficacy and safety database’’ and 
studies with large patient samples will be needed to differentiate selective 
IL-23 antagonists from broader IL-12/23p40 antagonists in terms of safety. 

Tildrakizumab demonstrated slow systemic clearance and a long half- 
life (~3 weeks). The mean f/2 value ranged from 20.2 to 26.9 days (parts 
1, 2 and 3). The mean clearance ranged from 1.57 to 2.50 ml day ‘kg ' 
for 0.05 to 10 mgkg’ *. For these pharmacokinetic parameters, no dose 
trends were observed. For part 2 only, an analysis of variance (ANOVA) 
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Figure 1 | Tildrakizumab rapidly provides clinical benefit to moderate-to- 
severe psoriasis patients. a, b, Mean percentage change in baseline PASI 
scores in parts 1 and 3 (a) and part 2 (b) over time. The percentage change in 
baseline PASI scores in placebo-treated patients from parts 1, 2 and 3 were 
combined together and used as controls for both graphs. c, Representative 
clinical photographs from a responder in the 10 mgkg ! tildrakizumab group 
in part 2 (n = 14) demonstrate improvement in day 1 (D1) psoriatic lesions at 
days 28, 112 and140. Error bars denote s.e.m. 


dose proportionality test showed that the area under the curve (AUC) 
and maximum concentration (C;nax) were dose proportional for doses 
of 3and 10 mgkg ’. Exposure was linear over the administered doses. 

Of the 56 tildrakizumab-treated subjects, 51 were pre-treatment neg- 
ative for anti-drug antibodies (ADA). Nine of these (18%) had at least 
one post-treatment ADA-positive sample and five of these nine showed 
lower tildrakizumab exposure than ADA-negative subjects. Subjects 
who had ADA-positive samples did not differ in their PASI response 
or adverse effect profile; further research will better assess whether the 
development of ADAs will affect safety or efficacy. Subjects who were 
ADA-positive before treatment may have had previous treatment with 
recombinant antibodies that generated a cross-reacting ADA response 
or there may have been matrix interference in the ADA assay. The 
screening ADA assay was designed to have a 5% false positive rate to 
increase assay sensitivity and therefore reduce the risk of missing sub- 
jects who develop ADA”. 
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We also conducted histological, immunohistochemical and gene 
expression analyses of lesional skin after tildrakizumab treatment. 
Psoriatic lesions are characterized by epidermal hyperplasia, hyper-/ 
parakeratosis, hypogranulosis, an increase in suprabasal mitotic kera- 
tinocytes, dilation and increased growth of papillary blood vessels, and 
leukocyte infiltration of both the dermis and epidermis. Both the 3 and 
10 mgkg’ ' doses of tildrakizumab resolved thickened lesional epider- 
mis back to a non-lesional state, but could not entirely convert back 
to the characteristics of normal skin (Fig. 2a). The amount of CD31* 
vessels was increased in lesional skin when compared with non- 
lesional skin and normal human skin; in contrast to the effects of til- 
drakizumab on epidermal hyperplasia, only a slight trend to reduced 
CD31~ vessel density was observed after treatment in lesional skin 
(Fig. 2b). Blood vascular changes may be underestimated using anti- 
CD31 staining, due to CD31 expression by lymphatic vessels in the pa- 
pillary dermis’. 

The histopathological psoriasis severity score (HPSS) was developed 
based ona previous scoring system and was used to quantify the overall 
global effect of tildrakizumab on the lesional skin architecture”. The 
total HPSS index was significantly reduced in all tildrakizumab groups, 
with a mean reduction of 67% (95% confidence interval: 53-81%) 
(Fig. 2c). In the 3mgkg ' and 10mgkg * tildrakizumab groups, the 
total HPSS was 16.5 + 1.3 (mean + s.e.m.) and 20.0 + 2.5 before treat- 
ment and 6.2 + 1.9and 4.4 + 1.2 after treatment, respectively, and these 
reductions were statistically significant (Fig. 2c). Although sample sizes 
for individual groups were small (shown in Fig. 1), significant reduc- 
tions in epidermal (for example, hyperplasia and mitotic scores), vas- 
cular and inflammatory cell infiltrate parameters were observed for the 
3mgkg ‘ and 10mgkg ' tildrakizumab subgroups, but not for the 
placebo group before and after treatment (Fig. 2d). Haematoxylin and 
eosin photomicrographs are shown in Fig. 3. 

IL-23 blockade had a significant effect on the immune compartment 
of psoriatic plaques. There was consistent clearance of most infiltrating 
leukocytic cell types in the lesional skin of tildrakizumab patients, sug- 
gesting that selective IL-23 blockade may induce disease remission. Con- 
sistent with a normalized skin environment, the proliferation marker 
Ki67 (Fig. 2e, f) and the abnormally expressed epithelial antigen keratin 
16 (Fig. 2g, h) were normalized after treatment. This correlates with a 
significant reduction in mitosis within the suprabasal epidermal layer 
(Fig. 2d). The inflammatory infiltrate in psoriatic lesions consists of 
CD3*, CD4* and CD8™ T cells; CD68" myeloid cells; BDCA-2* 
plasmacytoid dendritic cells; CD11c* myeloid dendritic cells; and 
CD15™ neutrophils. Epidermal CD4* and CD8* T cells (Fig. 4a, b), 
dermal myeloid dendritic cells (Fig. 4c), plasmacytoid dendritic cells 
(Fig. 4d), and CD15* neutrophils (Fig. 4f) were significantly decreased 
following tildrakizumab dosing. Importantly, dermal CD4* and CD8* 
T cells (data not shown) demonstrated decreased levels. Langerin™ epi- 
dermal Langerhans cell density was minimally altered in lesional psor- 
iatic skin and the density was not statistically affected by tildrakizumab 
treatment (Fig. 4e). 

Because IL-23 is a master cytokine in the orchestration of cutaneous 
inflammation’, we performed detailed analysis of lesional skin in 
10mgkg ' tildrakizumab and placebo patients to identify the cellular 
source of IL-23p19 in the dermal compartment. Results demonstrate 
CD11c* myeloid dendritic cells, CD15" neutrophils, and CD163° 
macrophages to be the main producers of IL-23p19, with a negligible 
number of CD117* mast cells (Fig. 5). These findings are consistent 
with previous reports showing that IL-23 is expressed by activated human 
macrophages and dendritic cells*; these data complement messenger 
RNA data showing overexpression of IL-23p19 and IL-12/23p40 in 
psoriatic skin®”®. 

Importantly, IL-23p19 expression trended to almost completely dis- 
appear after tildrakizumab treatment (Fig. 5), although it was not signi- 
ficantly different owing to high variability in infiltrating cell counts and 
the small sample size. The reduction in IL-23p19* cells after tildrakizumab 
treatment has three potential explanations. First, tildrakizumab still 


14 MAY 2015 | VOL 521 | NATURE | 223 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


no 
a 04; b € 20, e400, ; 
E 6 @ Pbo lesional (n = 5) 
= Me E © 3 mg/kg lesional (n = 10) 
g 0.34 E NS E @ 10 mg/kg lesional (n = 7) 
8 E NS oy 
S oo 8 404 o Seal O Pbo non-lesional (n = 5) 
i 4 3 © 3 mg/kg non-lesional (n = 10) 
© ag QD Y + O 10 mg/kg non-lesional (n = 7") 
E o14 * 8 | K 
3 |e&—-2 if ¥ ek 
iz A oO Z tOne biopsy was missing in 
OO ——— 8 _———————— 1 ‘before’ 
Before After Normal Before After Normal Before After Normal 
skin skin skin 
Before 5 8 . © G9 4. 
' ma After 2 - 4 ad Before : 
P<0.01 P<0.05 10 Pi a v i mm After re ra 
= T foo = | a a 2 8 
20 3 I x 7 © 34 
ro 5 1 4d a a 8 
a a T = a a 
ao 3S 1 oO 
Z 49 a 5 I oro 
i?) I iS 
B aml Ar | | : 
oO 
0 : LL LM | ee i a = + 
Pbo 3mgkg?10mg kg" Pbo 3 10 Pbo 3 10 Pbo 3 10 
(n=5)  =9 = (n=) (n=5)(2=9)(n=7) — (n=8)(n=9)(N=7) (n=) (n=9) (= 7) 0 * 
Epithelial Vascular Infiltrate Before After Normal 


skin 


Figure 2 | Tildrakizumab resolves altered epithelium in psoriasis patients. 
a, b, Lesional (filled) and non-lesional (open) biopsies were taken before and 
after three doses. Skin biopsies were taken from healthy volunteers (normal 
skin). Epithelial thickness (a) or dermal CD31~ vessel number (b) was 
measured in sectioned biopsies. c, d, HPSS scoring graded for overall epidermis, 
vascular and infiltrate changes (c) and individual scores for each parameter (d). 
e-h, Representative photomicrographs and analysis from lesional skin of a 


present in the biopsy specimen binds and blocks the IL-23-binding site 
of the anti-IL-23p19 monoclonal antibody used for immunohisto- 
chemical detection. Second, tildrakizumab exhibits a direct effect on 
the IL-23-producing cells resulting in their deletion. Third, the decrease 
in IL-23-producing cells is not an immediate direct downstream effect 
of IL-23 blockade, but rather a general sign of the healing plaque. Impor- 
tantly, cross-blocking experiments demonstrated that tildrakizumab 
does not interfere with the ability of the detecting anti-IL-23p19 anti- 
body to recognize and bind IL-23p19 (L. Bald, personal communica- 
tion), supporting one of the latter two explanations. 

Gene expression analysis of lesional skin pre- and post-treatment was 
performed to correlate histological and immunophenotypical changes 
with tildrakizumab effects on skin gene expression. IL-19 and IL-20 are 


responder in part 2 in the 10 mgkg ' tildrakizumab group (n = 7 for subjects 
on 10mg kg! who provided skin samples) before (left) and after (right) dose 
presented for anti-Ki67 (e, f) and anti-keratin 16 (g, h). Biopsies analysed 

for placebo (pbo) (n = 5), 3mg kg! (n = 10) and 10mg kg! tildrakizumab 
(n = 7), and healthy volunteers (n = 5). *P <0.05, *P < 0.01 (Wilcoxon 
matched-pairs signed-rank test). Data are mean + s.e.m. Original 
magnifications, < 100. 


keratinocyte growth/differentiation factors that were indirectly increased 
in skin by IL-23 action in preclinical models'°*’””. Tildrakizumab 
decreased IL-19 and IL-20 message levels that correlated with decreased 
epithelial thickness, decreased numbers of Ki67~ proliferating kerati- 
nocytes, and normalized keratinocyte differentiation (that is, keratin 16). 
The anti-microbial factors S100A7 and LCN2 were indirectly increased 
in skin by IL-23 action in preclinical models'*”* and tildrakizumab also 
normalized their expression. T}717 CD4 T cells express the chemokine 
receptor CCR6, whose ligand CCL20 is overexpressed in lesional skin. 
Neutrophils express CKCR1 and CXCR2, whose ligand CXCL8/IL-8 is 
also overexpressed”*”*. Tildrakizumab decreased CCL20 and CXCL8/ 
IL-8 message levels, which correlated with reduced CD4 T-cell and 
neutrophil infiltrates after tildrakizumab dosing (Fig. 6). 


Figure 3 | Representative haematoxylin and eosin photomicrographs of 
sections from lesional and non-lesional skin. a, b, Lesional skin before 
(a) and after (b) treatment. c, d, Non-lesional skin before (c) and after 
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(d) treatment. Sections represent a subject who was a responder in part 2 in the 
tildrakizumab dose group (n = 7). Original magnification, 200. 
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Figure 4 | Tildrakizumab diminishes the elevated dermal infiltrate in 


psoriatic skin. a-f, Lesional (filled) and non-lesional (open) biopsies were 
taken before and after three tildrakizumab doses or placebo. Skin biopsies were 


taken from healthy volunteers (normal skin). Biopsies were stained with 


anti-CD4 (a), anti-CD8 (b), anti-CD11c (c), anti-BDCA-2 (d), anti-CD207/ 
Langerin (e) or anti-CD15 (f). Representative photomicrographs of lesional 
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matched pairs signed-rank test); NS, not significant. Data are mean + s.e.m. 
Original magnifications, X 100. 


Figure 5 | Identification of IL-23p19-producing 
cells in psoriatic lesions. a, b, Four-millimetre 
lesional (filled circles) and non-lesional (open 
circles) punch biopsies were taken before dosing 
and after a course of three tildrakizumab doses or 
placebo. Skin biopsy sections were co-stained with 
anti-IL-23p19 and anti-CD11c (a) or anti-IL- 
23p19 and anti-CD163 (b). TissueQuest software 
was applied for image cytometry of 
immunofluorescence stainings. Skin biopsies were 
analysed for placebo (n = 4) and 10mgkg 
tildrakizumab (n = 4). Representative 
photomicrographs of lesional skin from a 
responder in the 10 mgkg~' group in part 2 (n =7 
for subjects on 10 mgkg * who provided skin 
samples). Data are mean + s.e.m. Original 
magnifications, 400. 
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Figure 6 | Tildrakizumab inhibits IL-23-associated psoriatic skin gene 
expression. Four-millimetre lesional (filled circles) and non-lesional (open circles) 
punch biopsies were taken before dosing and after three tildrakizumab doses or 
placebo. Lysates were prepared from biopsy sections for analysis on the NanoString 
nCounter gene expression platform. Skin biopsies were analysed from patients on 
placebo (n = 5), 3mgkg ' tildrakizumab (n = 10) or 10 mgkg ' tildrakizumab 
(n = 7). All genes were detectably increased in lesional skin compared with non- 
lesional skin, and were significantly decreased by at least one tildrakizumab dose 
using a P < 0.05 criteria in the Wilcoxon matched-pairs signed-rank test. 


Subsequent studies on tildrakizumab will address limitations in the 
study design. Although our data indicate dose-related efficacy, this trial 
did not have sufficient sample size to clearly show a dose response. A 
randomized, double-blind, placebo-controlled, parallel-group dose-range 
finding phase IIb trial will identify optimal doses (ClinicalTrials.gov 
(https://clinicaltrials.gov/) identifier NCT01225731). Skin biopsies were 
collected and processed for histological and immunohistochemistry 
analyses, but biopsies were not collected specifically for gene expression 
analysis. Dedicated biopsy collection for gene expression analysis will 
be performed in NCT01225731 to evaluate modulation of additional 
IL-23 pathway genes (for example, IL-17A, IL-17C, IL-17F and IL-22) 
that were below the limit of detection for technology used in this study 
to recover message from formalin-fixed tissue. Lastly, this trial was not 
designed to perform comparisons with other biologic treatments; an 
active comparator trial with a currently available treatment is currently 
in progress (ClinicalTrials.gov identifier NCT01729754). 

In summary, this study demonstrates that IL-23 is crucial in psori- 
asis pathogenesis and that selectively inhibiting IL-23 is a potentially 
important treatment modality. Tildrakizumab provided important clin- 
ical improvement and was found to be well tolerated in these patients 
with moderate to severe psoriasis. Further development of tildrakizu- 
mab is warranted based on these results to determine whether selective 
targeting of IL-23 can provide similar or better efficacy while reducing 
safety concerns that are associated with other biologic agents currently 
approved for the treatment of psoriasis. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


This study (protocol 5382) was conducted from October 2008 to September 2011 
in nine study sites. The protocol was reviewed and approved by local Institutional 
Review Boards; all subjects signed written informed consent. Subjects also pro- 
vided consent to publication of clinical photographs. The study followed principles 
of Good Clinical Practice. Skin biopsies from healthy volunteers were obtained at 
the Vienna University Medical School. The investigational protocol was approved 
by the local Ethics committee of Vienna University’s Medical School, the Midlands 
Institutional Review Board (USA), and Ethical Committee for the Region of Capital 
(Denmark). All subjects agreed to study terms by giving their informed consent 
including their approval of documentation and publication of clinical pictures. 
Subjects. Subjects who were included in this study were aged 18-65 years and were 
to have a diagnosis of established and stable psoriasis vulgaris with an off-treatment, 
screening PASI score no more than 40% different than baseline PASI. Subjects were 
to have moderate to severe psoriasis that was severe enough to render them eligible 
for systemic therapy (that is, body surface area was =10% and PASI was =12 (if 
on therapy body surface area was = 10% and PASI was = 10)). Target lesions were 
on the head, trunk, arms or legs and were =10 cm” (~3cm diameter). The total 
numerical ratings for erythema, infiltration and desquamation were =6 out of 
possible 12 with 0 = none, 1 = mild, 2 = moderate, 3 = severe, 4 = very severe; 
desquamation must have been =2. 

All subjects were to be otherwise healthy: normal ECG, normal vital signs, no 

history of latent or active tuberculosis; subjects were to be negative for tuberculosis 
1 month before and had no recent contact with tuberculosis, no history of malig- 
nancy, no inflammatory bowel disease, ulcers, gastrointestinal bleeding, gastro- 
intestinal tract surgery, pancreatic injury, liver disease, impaired renal function, 
urinary obstruction, HIV, hepatitis C, hepatitis B, no infectious disease <4 weeks 
before study drug administration, no blood donation <60 days before study, no 
vaccine <1 month before the study, no immunosuppressive agent <4 weeks before 
study, no phototherapy <4 weeks before study, no topical treatments <1 week 
before study, and no monoclonal antibodies <3 months before study. 
Study design. This was a three-part, multiple-dose, randomized, placebo-controlled, 
patient- and evaluator-blind, multi-centre study of tildrakizumab in subjects with 
psoriasis (Extended Data Fig. 2). Part 1 was a safety, tolerability and pharmacoki- 
netic study that examined rising doses of tildrakizumab and was planned to enroll 
24 subjects with psoriasis into four ascending dose cohorts. Subjects were rando- 
mized within each cohort in a 3:1 ratio of active to placebo. Each subject received a 
single dose of 0.1 mg kg” ' (cohort 1), 0.5 mgkg™' (cohort 2), 3 mg kg”! (cohort 3) 
or 10mgkg~' (cohort 4) of tildrakizumab intravenously administered over 1h or 
placebo. After an 8-week safety observation period, each subject in part 1 (cohorts 1 
to 4) received an additional two doses of tildrakizumab intravenously at 0.1, 0.5, 3 
or 10mgkg ' or placebo, respectively, 4 weeks apart. Serum pharmacokinetic sam- 
ples were obtained pre-dose and at the time points specified in Evaluation Criteria. 
Subjects were followed for 196 days after the first dose. 

Part 2 examined the efficacy of 3 and 10 mgkg ' tildrakizumab with a higher 
number of subjects per group and was planned to enroll 40 subjects with psoriasis 
into two ascending dose cohorts (cohorts 5 and 6). Subjects were randomized in a 
3:1 ratio of active to placebo. Subjects in cohort 5 received three doses of 3 mgkg * 
of tildrakizumab intravenously or placebo given at weeks 0, 4 and 8. Cohort 6 
received three doses of 10 mgkg ' tildrakizumab intravenously or placebo given 
at weeks 0, 4 and 8 (Extended Data Fig. 2). Serum pharmacokinetic samples were 
obtained pre-dose and at the intervals specified in Evaluation Criteria. Subjects 
were followed for 364 days after the first dose. Skin biopsies of lesional and non- 
lesional skin for histology and immunohistochemistry evaluation were obtained at 
baseline (before) and after a course of three doses of tildrakizumab or placebo at 
either day 63 or day 84 in a subgroup of patients at two clinical sites. 

Part 3 was added owing to efficacy observed at the low doses in part 1; we further 
examined efficacy using data from part 3. Part 3 of the study was planned to enroll 
12 subjects with psoriasis into cohort 7; these patients were randomized in a ratio 
of 2:1:1 to receive 0.05 or 0.1 mg kg! tildrakizumab or placebo. All three treat- 
ments were administered in a parallel fashion (that is, there was no sequential dose 
escalation). Subjects received three doses of 0.05 or 0.1 mg kg * of tildrakizumab 
or placebo intravenously given at weeks 0, 8 and 12 (Extended Data Fig. 2). Serum 
pharmacokinetic samples were obtained pre-dose and at the intervals specified in 
Evaluation Criteria. Subjects were followed for 364 days after the first dose. Skin 
biopsies of lesional and non-lesional skin for histology and immunohistochem- 
istry evaluation were obtained at baseline and on day 84. 

PASI assessments were carried out at protocol-specified intervals until day 196 
in part 1 and day 364 in parts 2 and 3. 

Before administration of study medication, a computer-generated randomization 
number was assigned to all subjects who met the study’s entry criteria. The ran- 
domization number was sent to the study site by the sponsor and was then used to 
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assign treatments to subjects. Blinding was conducted under third-party blinding 
conditions in which only the site pharmacist, who was not involved with study 
procedures or evaluations, had knowledge of the treatment code. The pharmacist 
prepared the treatment, which was then dispensed in a blinded fashion by the inves- 
tigator. If blinding needed to be broken, the reason was to be recorded and the 
sponsor notified immediately. Blinded assessments were done by the sponsor and 
investigators on safety, pharmacodynamics, and pharmacokinetics. The rando- 
mization code was sent to a bioanalytical unit for analysis of samples. 
Objectives. The primary objectives were to determine the safety, tolerability and 
pharmacokinetics of rising intravenous doses of tildrakizumab in subjects with 
moderate-to-severe psoriasis (PASI = 12). The secondary objective was to evaluate 
the effect of tildrakizumab on skin disease in moderate-to-severe psoriasis; although 
change in PASI was a secondary end point, the study was powered to detect a 
placebo-corrected change in PASI of 30%. The primary end point was the placebo- 
corrected percentage change in PASI at day 112 (week 16). Additional endpoints 
included skin biopsies for exploratory immunohistochemistry, histology and gene 
expression. 

Timing of dose administration and evaluations. In parts 1 and 3, treatments 
were administered on study days 1, 56 and 84. Biopsies were collected at baseline 
(within 3-7 days of dosing on day 1) and on study day 112 (4 weeks after the last 
dose). PASI scores were determined at screening (8-56 days before dose), baseline 
(3-7 days before the dose), study days 14, 28, 56, 84 and 112 (4 weeks post the last 
dose), 196 and 364 (part 3 only). 

In part 2, study drug was administered on study days 1, 28 and 56. Skin biopsies 
were collected at baseline (3-7 days before drug administration on day 1) and study 
day 84 (4 weeks post the last dose). PASI scores were evaluated at screening (day 8 
to 56 days before study drug administration on day 1), baseline (as defined for part 
1/3) and study days 14, 28, 56, 63 and 112 (8 weeks after the last dose), 196, 224, 
252, 280, 308, 336 and 364. 

Beyond study day 7, a 3-day window was allowed for scheduled visits. This window 
was considered acceptable owing to the exploratory, proof-of-concept nature of 
the study, the 21-day half-life of tildrakizumab, and the gradual time course for 
skin clearing. 

Safety and tolerability. Adverse events were recorded and vital signs were mon- 
itored throughout the study. Adverse events were evaluated for severity and rela- 
tionship to study drug by the investigators. The decision to discontinue therapy 
due to adverse events was made by the investigator. Safety assessments (including 
ECGs, clinical laboratory measurements, adverse event monitoring, collection of 
vital signs), and blood and urine assessments were performed throughout the study 
on the visit days. 

Pharmacokinetic parameters. Single and multiple dose tildrakizumab plasma 
concentrations and derived pharmacokinetic parameters including half-life, Trax 
(the time when Cyyax is observed), Cyrax and AUC were assessed. For part 2 only, 
preliminary single and multiple dose proportionality was assessed using a one-way 
ANOVA model extracting the effect due to dose. Ratio estimate and 90% confi- 
dence intervals were provided for the higher dose versus the lower dose in part 2. 
Pharmacodynamic parameters. The primary efficacy end point was the percent- 
age change in PASI score from baseline at 4 or 8 weeks after the last treatment day 
(that is, week 16 for parts 1, 2 and 3). PASI is a commonly used index for the eval- 
uation of pharmacologic agents in the treatment of psoriasis”””*. 

Histology, immunohistochemistry and gene expression of skin biopsies. Four- 
millimetre punch biopsies were obtained in duplicate from lesional and non-lesional 
skin. For reasons of comparability, target sites of biopsy specimens were always 
located on trunk, thighs or arms for microscopic evaluations. Skin samples were 
either fixed in formalin and paraffin embedded or embedded in optimum cutting 
tissue compound (Tissue-Tek, Sakura Finetek Europe B.V.), snap-frozen in liquid 
nitrogen, and stored at —80°C until further processing. Samples for histology 
were cut from formalin-fixed paraffin-embedded blocks, stained with haematox- 
ylin and eosin, and analysed by light microscopy. Pre-dose and post-dose biopsies 
for histological and immunohistochemical analysis were analysed from five placebo- 
treated patients, ten patients treated with 3 mgkg— ! tildrakizumab, and seven 
patients treated with 10 mgkg~'. Immunohistochemical stainings of the study 
patients were compared to stainings of normal human skin samples from a pre- 
vious study”’. 

Histopathologic psoriasis severity score. Criteria used to calculate the HPSS were 
chosen on the basis of well-established histopathological criteria for the diagnosis 
of plaque-type psoriasis*. Specifically, changes affecting the epidermis, involve- 
ment of the blood vascular component and the amount and distribution of the 
inflammatory cell infiltrate are assessed. The changes are graded according to severity 
into 0 (absent), 1 (mild), 2 (moderate) and 3 (severe), respectively. The individual 
scores for each main category (epidermal, vascular, inflammatory cells) are added 
to obtain a total HPSS. The HPSS can range from 0 to 36, while each subcategory 
can yielda score between 0 and 12, depending on the grading (0-3) of the individual 
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features of each category. Extended Data Table 5 highlights the individual sub- 
categories that sum up to each of the main categories. Histopathological analysis 
was performed independently by two dermatopathologists in a blinded fashion 
and Bland-Altman plots were used to assess the agreement between readings of 
the HPSS of two independent observers. Bland-Altman analysis showed good 
agreement between HPSS readings of two independent readers owing to the mean 
difference between readings of observer 1 and observer 2 being —0.3 (95% limits 
of agreement: —2.8 to 2.5) and thus not significantly different from zero. 
Immunohistochemical staining. Immunohistochemistry was performed on fro- 
zen tissue as described before”. The purified monoclonal antibodies used for these 
stainings are shown in Extended Data Table 6. In brief, after cutting, mounting and 
fixing the tissue on slides, the primary antibodies were applied overnight in a humid 
chamber at 4 °C. Usinga staining kit (Vectastain Elite ABC kit, mouse IgG, Vector 
Laboratories), sections were then incubated with a matching secondary antibody 
followed by incubation with an avidin-biotin complex for signal amplification. 
Finally, cells were visualized using 3-amino-9-ethyl-carbazol (Sigma-Aldrich) 
and counterstained with haematoxylin (Merck). 

Immunofluorescence staining. To determine the phenotype and number of infil- 
trating leukocyte subsets, multicolour immunofluorescence stainings were per- 
formed as described before*'. The monoclonal antibodies used and their sources 
are shown in Extended Data Table 6. 

In brief, after incubation of the unconjugated primary antibody overnight at 
4 °C, slides were washed and an appropriate second step was applied (Rhodamine 
(TRITC)-conjugated AffiniPureF(ab’), fragment goat anti-mouse IgG (H+L), 
Jackson ImmunoResearch Laboratories, Northern Light 637-conjugated donkey 
anti-mouse IgG, R&D Systems or Alexa 633-conjugated donkey anti-goat IgG, 
Invitrogen, Molecular Probes). After blocking with normal mouse serum (DAKO) 
to diminish background problems, secondary antibodies, either biotinylated or 
fluorescence-labelled, were applied for at least 6h at 4°C. Sections were further 
incubated with streptavidin TRITC (Jackson ImmunoResearch Laboratories), Strep- 
tavidin Alexa 488 (Invitrogen, Molecular Probes) or Streptavidin 637 NL (R&D 
Systems) and/or Oregon Green-labelled goat IgG anti-FITC (Invitrogen, Molecular 
Probes) to visualize antibody binding. Finally, nuclei were visualized by incubation 
with 4’,6-diamidino-2-phenylindole (DAPI). 

In all staining experiments, substitution of the antibodies with isotype-matched 

immunoglobulin (purified, biotinylated or fluorescence-labelled) served as nega- 
tive controls. 
Evaluation of immunohistochemical and immunofluorescence results. Data 
acquisition was performed by using the TissueFAXS technology (TissueGnostics), 
based on a motorized Zeiss Observer Z1 automated multichannel fluorescence 
microscope, as previously described”. 

HistoQuest and TissueQuest softwares (TissueGnostics) were applied for image 
cytometry of immunohistochemical and immunofluorescence stainings, respec- 
tively. Positive cells in the epidermal and dermal compartments were analysed sep- 
arately. For measuring epidermal thickness, the epidermis was randomly marked 
at ten different sites. Thickness was then determined by calculating the mean value 
of these ten measurements. For evaluation of keratin 16, a score ranging from 0 
(healthy skin) to 4 (maximum psoriatic lesion) was used*’. 

Confocal immunofluorescence microscopy (LSM 510, Zeiss) was used to fur- 
ther document immunofluorescence images at a higher magnification. 

Gene expression analysis. Whole-cell lysates were prepared from slides of form- 
alin-fixed paraffin-embedded tissue for analysis on the NanoString nCounter gene 
expression platform (NanoString Technologies). Before making the cell lysate, tissue 
sections were deparaffinized in xylene for 3 X 5 min and then rehydrated by immers- 
ing consecutively in 100% ethanol for 2 X 2 min, 95% ethanol for 2 min, 70% ethanol 
for 2 min and then immersed in dH,O until ready to be processed. Tissue was lysed 
on the slide by adding 10-50 il of PKD buffer. Tissue was scraped from the slide 
and transferred to a 1.5 ml Eppendorf tube. Proteinase K was added at no more 
than 10% final volume and the RNA lysate was incubated for 15 min at 55 °C and 


then 15 min at 80°C. The RNA lysate was stored at —80 °C until gene expression 
profiling was performed using the NanoString nCounter system on the following 
genes: CAMP, CCL20, CCR6, CXCL8, DEFB104A (also known as DEFB4), IFNG, 
IL12B, IL17A, IL17C, IL17F, IL19, IL20, IL22, IL23p19 (also known as IL23A), 
IL23R, IL24, LCN2, RORC and S100A7. 

Statistical methods. Subjects were randomized in this study to balance the pre- 
conditions of the subjects in the different treatment groups. This study was designed 
to enroll approximately 64 subjects (24 in part 1, and 40 in part 2). Additional 
subjects were to be enrolled at the discretion of the study sponsor to explore further 
interim doses or to increase the power of the statistical analyses. The sample size for 
parts 1 and 3 was determined by practical considerations only as it was designed to 
assess safety. In part 2, subjects were assigned to each of the dose levels in a 3:1 ratio 
(that is, 15 subjects on active and 5 on placebo in each of the dose groups). No 
statistical methods were used to predetermine sample size. 

With 15 subjects on tildrakizumab and 16 (6 + 5 + 5) subjects on placebo, the 
study was designed to detect a difference of approximately 30% in the mean change 
from baseline PASI score compared with the placebo group. This assumes a 0% 
mean change from baseline in the placebo group and a standard deviation of 40%, 
respectively, with 80% power at an alpha level of 0.1 (one sided test). Subjects were 
pre-planned to be pooled for the purpose of increasing the power of the statistics 
test. 

The percent change in PASI score was analysed using a one-way ANOVA model 
extracting the effect due to treatment. The mean and 95% confidence interval was 
provided for each treatment group as well as the difference between tildrakizumab 
and placebo. Durability of effect was evaluated in an exploratory and graphical 
fashion. 

For HPSS analysis, statistical comparison between psoriasis patients before and 
after treatment with tildrakizumab were performed using the non-parametric 
Wilcoxon matched-pairs signed-rank test calculated using the GraphPad Prism 
Software. Bland-Altman plots were used to assess the agreement between readings 
of the HPSS of two independent observers. The data are presented as the mean and 
s.e.m. 

For immunohistological and immunofluorescence analysis, statistical comparison 
between psoriasis patients before treatment with tildrakizumab and after treat- 
ment were performed using the non-parametric Wilcoxon matched-pairs signed- 
rank test calculated using the GraphPad Prism Software. The data are presented as 
the mean and s.e.m. 

For gene expression analysis, statistical comparison between psoriasis patients 
before treatment with tildrakizumab and after treatment were performed using 
the non-parametric Wilcoxon matched-pairs signed rank test calculated using the 
GraphPad Prism Software. 
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Extended Data Figure 1 | Tildrakizumab binds to IL-23, but not IL-12. 
a, b, Biacore sensorgrams showing the 37 °C binding kinetics of human IL-23 
(a) and human IL-12 (b) to immobilized tildrakizumab as a threefold dilution 
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series from 0.091 to 22.2 nM. The association phase was measured for 220s 
followed by a dissociation phase measurement for 1,800 s. The affinity of 
tildrakizumab for IL-23 is ~300 pM. 
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Extended Data Figure 2 | Study flow chart. 
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Extended Data Table 1 | Baseline patient demographics 
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Placebo 
nt 0.1 mg/kg 0.5 mg/kg 3 mg/kg 3 mg/kg 10 mg/kg 10 mg/kg (Parts 1, 2, & 
(Part 3) (Part 1) (Part 1)* (Part 2) (Part 1) (Part 2) Total 
(Part 1 & 3) 
n=6 n=6 n=3 n=7 n=15 n=6 n=14 n=77 
Sex (n,%) 

Female 2( 33) 1( 17) 1( 33) 5( 33) 2( 33) 1( 7) 16( 21) 
4( 67) 5( 83) 2( 67) 7( 100) 10( 67) 4( 67) 13 ( 93) 16( 80) 61( 79) 

Race (n,%) 

White 6 (100) 15 (100) 19( 95) 69( 90) 
American Indian 0 0 2( 3) 
or Alaskan Native 

Asian 3( 4) 
Black or African 3( 4) 
American 
Age (yrs) 

Mean (SD) 45.0 (9.1) 48.2 (11.7) 47.3 (11.9) 52.7 (6.2) 49.3 (11.7) 46.2 (11.1) 46.0 (14.4) 45.5 (11.6) 47.3 (11.4) 

Range 33-55 31-62 34-57 42-61 22 - 63 32 - 65 22 - 64 25-61 22-65 
Age (n,%) 

18 - <65 6( 100) 6( 100) 3 (100) 7( 100) 15 (100 ) 14( 100) 20 ( 100 ) 76( 99) 


65 or Older 


0 


0 


0 


0 


0 


0 


dit} 


Weight (kg) 


Mean (SD) 


96.88 (20.37) 


95.85 (23.49) 


107.63 (24.58) 


90.27 (10.96) 


96.51 (10.87) 


94.25 (25.88) 


95.69 (20.46) 


102.46 (25.49) 


97.67 (20.23) 


* Includes one subject that was randomized but never treated. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 2 


Subjects on tildrakizumab who achieved PASI75 and PASI90 on day 112 


Parts 1 and 3 Pooled Part 2 
Tildrakizum | 0.05 mg/kg 0.1 mg/kg 0.5 mg/kg 3 mg/kg 10 mg/kg 3 mg/kg 10 mg/kg 
ab dose 
n/N(%) n/N(%) n/N(%) n/N(%) n/N(%) n/N(%) n/N(%) 
PASI 75 2/5 (40%) 3/6 (50%) 1/3 (33%) 6/6 (100%) 3/5 (60%) | 10/15 (66%) | 13/14 (93%) 
PASI 90 2/5 (40%) 1/6 (17%) 0/3 (0%) 5/6 (83%) 1/5 (20%) 7/15 (47%) | 10/14 (71%) 
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Extended Data Table 3 | Summary of adverse events occurring in 2 or more subjects within a treatment group 


Adverse Event 0.05 0.1 mg/kg | 0.5 mg/kg | 3 mg/kg 3.0 mg/kg | 10 mg/kg 10 mg/kg Placebo Total 
mg/Kg (n=6) (n=3) (n=7) (n=15) (n=6) (n=14) (n=20) (n=77) 
(n=6) Parts 1 Part 1 Part 1 Part 2 Part 1 Part 2 ALL Parts 
Part 3 and 3 
No Subjects reporting 6 (100) 4 (67) 3 (100) 5 (71) 15 (100) 2 (33) 12 (86) 15 (75) 62 (81) 
any adverse event (%) 
Headache 1 (17) 1(17) 1(33) 1 (14) 4 (27) 0 3 (21) 3 (15) 14 (18) 
Upper respiratory tract 1 (17) 2 (33) 1 (33) 0 2 (13) 0 5(36) 3 (15) 14 (18) 
infection 
Cough 0 1(17) 0 1 (14) 4 (27) 0 3 (21) 3 (15) 12(16) 
Nasopharyngitis 0 1 (17) 0 1(14) 4 (27) 0 4 (29) 2 (10) 12 (16) 
Arthralgia 0 0 0 1 (14) 1 (7) 0 1 (7) 3 (15) 6 (8) 
Back pain 0 0 0 1 (14) 2 (13) 0 0 1 (5) 4 (5) 
Blood creatine 0 0 0 0 2 (13) 0 2 (14) 0 4 (5) 
phosphokinase 
increased 
Hypertension 2 (33) 0 0 1(14) 0 0 0 1 (5) 4 (5) 
Oropharyngeal pain 0 0 0 0 2 (13) 0 1(7) 1 (5) 4 (5) 
Rhinitis 1(17) 0 0 0 2 (13) 0 1 (7) 0 4 (5) 
Eczema 0 0 0 0 2 (13) 0 1 (7) 0 3 (4) 
Fatigue 0 0 0 0 1 (7) 0 0 2 (10) 3 (4) 
Pruritus 0 0 0 0 2 (13) 0 0 1 (5) 3 (4) 
Sinusitis 0 0 0 0 0 0 0 3 (15) 3 (4) 
Insomnia 0 2 (33) 0 0 0 0 0 0 2 (3) 
Leukopenia 0 0 0 0 2 (13) 0 0 0 2 (3) 
Paraesthesia 0 0 0 0 2 (13) 0 0 0 2 (3) 
Psoriasis 0 0 0 0 0 0 0 2 (10) 2 (3) 
Sciatica 0 0 0 0 2 (13) 0 0 0 2 (3) 
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Extended Data Table 4 


Summary of serious adverse events 


Subject SAE Treatment Relationship to 

treatment 

1 Upper limb fracture 0.05 mg/kg tildrakizumab Unrelated 

2 Ankle fracture 0.05 mg/kg tildrakizumab Unrelated 

3 Pneumothorax 3 mg/kg tildrakizumab Unrelated 

4 Meniscal abrasion 3 mg/kg tildrakizumab Unrelated 

5* Cardiac enzymes increased 3 mg/kg tildrakizumab Unrelated 

Myositis 
Cytomegalovirus infection 
6 Convulsion 10 mg/kg tildrakizumab Possibly related** 
7 Pulmonary embolism 10 mg/kg tildrakizumab Unrelated 
Pulmonary hypertension 

8 Facial bones fracture 10 mg/kg tildrakizumab Unrelated 
* Subject 5 had a serious adverse event (SAE) of increased cardiac enzymes (44 weeks after treatment) and myositis (58 weeks after treatment); treatment of the condition with prednisone likely contributed to the 
cytomegalovirus infection (59 weeks after treatment). The primary investigator did not consider these serious adverse events to be related to study medication. **In subject 6, the convulsion event was confounded 
by the patient’s recent history of alcohol abuse, acute alcohol and benzodiazepine withdrawal as well as lack of sleep preceding the event. The investigator considered the event to possibly be related to the study 


drug owing to its long half-life; the event occurred 17 days after the second dose of 10 mgkg ! tildrakizumab. 
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Extended Data Table 5 | The histopathologic psoriasis severity score 


Main Subcategories (grading 0-3 each) Score 
Categories 
Epidermal Hyperkeratosis Hyperplasia Hypogranulosis Mitosis 0-12 
changes 
Vascular Dilated vessels Elongated Increased Contorted 0-12 
changes vessels number vessels 
Inflammatory Perivascular Neutrophils Neutrophils Histiocytes 0-12 
cells lymphocytes intraepidermal intracorneal intradermal 
Total HPSS 0-36 
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Extended Data Table 6 


Antibodies used for immunohistochemistry and immunofluorescence 


Antigen Supplier Clone Isotype Santee Reference 
IL23p19 pur. Merck 12F12 mlgG2a = Wilson NJ et al 2007, Nat Immunol. 8(9):950-7 
CD31 pur. Dako JCT0A mlgG M0823 Parums DV et al 1990, J Clin Pathol 43:572-577 
Ki-67 pur. Dako MIB-1 mIgG MIB-1 Gerdes J et al 1992, J Pathol; 168:85-6 
Keratin 16 pur. Neomarkers LL025 mIgG MS-620-PO Wetzels RH et al 1991, Am J Pathol; 138:751-763 
CD3 pur. Beckman Coulter UCHTI1 mIgG IM1304 Thibault G et al 1995, J Immunol; 154: 3814-3820 
CD8 pur. Dako C8/144B migG M7103 Mason DY et al 1992, J Clin Pathol; 45: 1084-88 
CD4 pur. Dako MT310 migG M0716 Leong AS-Y et al 1999; London: Oxford University Press: 49-50 
CD11c pur. JOTest/Beckmann Coulter BUIS5 mlgG IM0712 Kohrgruber N et al J. Immunol; 163: 3250-3259 
BDCA-2 pur. Dendritics 104C12.08 mlgG DDxX0041 Stary G et al 2009, Blood; 114: 3854-3863 
Langerin pur. 10Test/Immunotech DCGM4 mlgG IM3449 Valladeau J et al 2000, Immunity; 12: 71-81 
CD117 APC Immunotech 104D2D1 mIgG IM3638 Uoshima N et al 1995, Br. J. Haematol; 91: 30-36 
CD1 1c biot Biolegend 3.9 mIgG 301612 Gurer C et al 2008, Blood; 112:1231-1239 
CD15 biot Novus Biologicals HI98 mIgM NBP1-43673 Lund-Johansen F et al 1992, J Immunol. 148: 3221-3229 * 
CD163 FITC Acris 5C6-FAT mlgG BM4041F Zaba LC et al 2007, J Exp Med; 204:3183-3194 


* Novus Biologicals was only founded in summer 1996. Publication cited refers to the same clone (HI98) from a different company. 
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Nuclear architecture dictates HIV-1 integration 


site selection 


Bruna Marini’, Attila Kertesz—-Farkas**, Hashim Ali!*, Bojana Lucic't, Kamil Lisek'}, Lara Manganaro't, Sandor Pongor’t, 
Roberto Luzzati*®*, Alessandra Recchia’, Fulvio Mavilio”®, Mauro Giacca’“8 & Marina Lusic!8+ 


Long-standing evidence indicates that human immunodeficiency 
virus type 1 (HIV-1) preferentially integrates into a subset of 
transcriptionally active genes of the host cell genome’ *. However, 
the reason why the virus selects only certain genes among all 
transcriptionally active regions in a target cell remains largely 
unknown. Here we show that HIV-1 integration occurs in the outer 
shell of the nucleus in close correspondence with the nuclear pore. 
This region contains a series of cellular genes, which are preferentially 
targeted by the virus, and characterized by the presence of active 
transcription chromatin marks before viral infection. In contrast, 
the virus strongly disfavours the heterochromatic regions in the 
nuclear lamin-associated domains’ and other transcriptionally active 
regions located centrally in the nucleus. Functional viral integrase 
and the presence of the cellular Nup153 and LEDGF/p75 integration 
cofactors are indispensable for the peripheral integration of the 
virus. Once integrated at the nuclear pore, the HIV-1 DNA makes 
contact with various nucleoporins; this association takes part in the 
transcriptional regulation of the viral genome. These results indicate 
that nuclear topography is an essential determinant of the HIV-1 
life cycle. 

One important aspect of the interaction between HIV-1 and its target 
cells is the encounter between the viral complementary DNA (cDNA) 
with the complex architecture of the mammalian nucleus, in which 
chromosomes and genes are spatially arranged to occupy preferred posi- 
tions within the three-dimensional space’. 

We analysed the lists of human genes targeted by HIV-1 from six 
different studies (Extended Data Table 1), containing altogether 1,136 
unique gene integration sites in activated T cells carrying the CD4 anti- 
gen (CD4*); 126 of these genes recurred in two lists, 24 in three, and six 
in atleast four lists, for a total of 156 genes, which we named HIV recur- 
rent integration genes (RIGs). The probability of detecting this number 
of specific genes by chance was extremely low (P< 1 X 10°; Extended 
Data Fig. 1a). RIGs were also highly represented in another list of approx- 
imately 12,000 integration sites*, 5,221 of which were unique genes, as 
well as in two integration lists generated from patients’ CD4* T cells”® 
(P< 0.001 of detecting these genes by chance). Thus, RIGs are bona 
fine the hottest spots of HIV-1 integration. 

We then ranked RIGs according to their frequency and plotted them 
onto the human chromosome map”. Unexpectedly, they appeared to 
cluster into specific chromosomal regions (Extended Data Fig. 1b). In 
five out of eight cases, RIGs were also in proximity to the ‘hotter zones’, 
previously defined as regions with remarkably high HIV-1 integration 
density’ (Supplementary Table 1). In these areas, observations hinted 
at the possibility that the topological distribution of these chromosomal 
regions inside the nucleus could determine HIV-1 integration. 


By applying three-dimensional immuno-DNA fluorescence in situ 
hybridization (FISH), we assessed the position of RIGs and hotter zones 
in primary CD4* T cells from healthy donors. Selected FISH probes, 
listed in the Supplementary Information, provided topological infor- 
mation for a total of 169 RIGs and other integration sites located within 
10 megabases (Mb) from the centre of the probe (Extended Data Fig. 2). 

When the radial positions of the RIG FISH signals were binned into 
three zones of equal area’®"’ (Fig. 1a), a clear gradient in signal localiza- 
tion was observed, which decreased from the nuclear envelope towards 
the interior (images of 14 RIGs in Fig. 1b, c; four hotter zones in Fig. 1d). 
The global distribution of RIGs (n = 1,420 analysed alleles) was remark- 
ably different from that of control genes, all of which were expressed in 
CD4* T cells!?"3 (n = 522): 44% of RIGs mapped in zone 1, 41.5% in 
zone 2 and only 14.5% in zone 3 versus 25.6%, 47.6% and 26.8% for con- 
trol genes, respectively (Fig. 1f; representative images of control genes 
are shown in Fig. le). Considering an average of about 7 um for the 
nuclear diameter in CD4* T cells, 63% of RIGs and hotter-zone alleles 
were concentrated within about 1 um below the nuclear membrane. 

We wanted, therefore, to visualize the position of the HIV-1 DNA itself 
in infected, primary CD4* T-cell nuclei. At 4 days after infection with 
the VSV-G-pseudotyped HIV-1y14-3/z-p., the vast majority of the pro- 
viral immuno-FISH signals were in zone 1 (75.2% within 1 1m under 
the nuclear envelope) (Fig. 2c). The visualized viral DNA was integrated’, 
as also detected by real-time Alu PCR (Fig. 2a), and transcriptionally 
active (Fig. 2b). A similar distribution was observed in primary macro- 
phages and the monocytic cell line U937 (Extended Data Fig. 3a, b, respec- 
tively). Peripheral localization was also observed for a fully competent 
virus carrying the HIV-1gry envelope’* (Fig. 2d) and, notably, for the 
wild-type viruses found in CD4* T-cells from two HIV-infected patients 
(Fig. 2e, f). Peripheral localization was also a feature of lentiviral vectors, 
irrespective of their transcriptional activity (Extended Data Fig. 3c, d), 
but not of the MoMLV gammaretrovirus, which localized preferen- 
tially inside the nuclear interior (Extended Data Fig. 3e). 

In contrast, when integration was impaired, the viral cDNA roamed 
around the nucleus. This was the case for two HIV-1 clones harbouring 
single-point mutations in the integrase catalytic domain (class I IN 
mutations: IN(D64E) and IN(D116N))!”"* or for HIV-1y74-3/g-r. in the 
presence of the integrase inhibitor raltegravir; under these conditions, 
only 10-20% of viruses were found in zone 1 (Fig. 2g). In these cases, the 
detected viral genomes did not correspond to integrated DNA (Fig. 2h) 
but were highly enriched in circular forms of viral DNA containing two 
long terminal repeats (2-LTR circles) (Fig. 2i). We also downregulated 
the chromatin tethering factor LEDGF/p75 (ref. 19) and the inner nuclear 
basket protein Nup153 (ref. 20), which are involved in viral DNA inte- 
gration (Fig. 2j). FISH was performed 48 h after infection when there 
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Figure 1 | Localization of HIV RIGs at the nuclear periphery. a, Subdivision 
of nucleus into three concentric zones of equal area. b-e, Three-dimensional 
immuno-DNA FISH of ten HIV RIGs (b, c), four hotter zones (d) and 

seven control genes (e) in activated CD4° T cells (green: bacterial artificial 
chromosome (BAC) probe labelled with DIG (dUTP-digoxygenin) and FITC 
(fluorescein isothiocyanate); red: NPC staining by mAb414). Below each 
representative image, the distribution of the analysed alleles into the three 


was a marked reduction in HIV-1 integration (Fig. 2k), and the majority 
of the signals labelled unintegrated viral DNA’*: 72% and 77% of the 
FISH signals were in zones 2 and 3 for the LEDGF/p75 and Nup153 
knockdowns, respectively (Fig. 21). The effect of the Nup153 knock- 
down was rescued by transfecting an expression plasmid coding for an 
RNA interference (RNAi)-resistant Nup153 (Extended Data Fig. 4). 

Most of the HIV-1 targets are common in different cell types; how- 
ever, subtle differences exist. For example, HIV-1 almost never targets 
the IKZF3 locus in CD34* haematopoietic stem cells”! (P< 1 10° |”), 
whereas the TAP2 gene from the major histocompatibility complex class 
II locus is never targeted in CD4* T cells (P< 1X 10"). Strikingly, 
we observed that IKZF3 localized in zones 1 and 2 in peripheral blood 
CD4° T cells (>80% ofalleles), while it was almost absent from zone 1 
in cord blood CD34" cells (<6% alleles; P< 0.001). Conversely, the 
TAP2 locus was absent from zone 1 in CD4* T cells (<8% of alleles), 
while it was distributed between zones 1 and 2 in CD34° cells (>90% 
of alleles; P< 0.001; Fig. 3a). 

To understand the chromatin features of RIGs, we compared the avail- 
able data from chromatin immunoprecipitation sequencing (ChIP-seq) 
obtained in CD4* T cells for RIGs”, cold genes (defined as transcrip- 
tionally inactive genes never targeted by HIV-1; F.M. and A.R., unpub- 
lished observations) and a list of genes corresponding to the 1,000 most 
expressed (active) and 1,000 least expressed (silent) genes from the GNF 
SymAtlas’*. Association of RNA Pol2 with RIGs had a pattern super- 
imposable to that of active genes, peaking at the transcription start sites 
(TSSs) (Fig. 3b). In a similar manner, distribution of markers of active 
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nuclear zones is shown, normalized over nuclear radius. Evenly distributed 
random genes would be enriched equally in the three zones (red dashed line). 
The number of alleles analysed is shown at the bottom of each panel. HZ, hotter 
zone. f, Distribution of the relative distances of all measured alleles from the 
nuclear envelope (HIV RIGs and hotter zones: n = 1,420; control genes: 

n = 522). The three zones are shown by grey shading. The dashed line indicates 
approximately 1 um from the nuclear edge of the T-cell nucleus. 


transcription (H3K9ac, H3K36me3, H3K4me3, H4K16ac and H4K20me) 
was identical for RIGs and active genes (Fig. 3c-e and Extended Data 
Fig. 5). In contrast, markers of facultative (H3K9mez2) and constitutive 
(H3K9me3 and H3K27me3) chromatin were found enriched both on 
cold genes (where HIV-1 never integrates) and on silent genes, but not 
on RIGs (Extended Data Fig. 5). Of interest, active genes and RIGs had 
a superimposable distribution of H3K4mez2, which is enriched at the 
lamin-associated domain (LAD) borders? (Fig. 3f). 

Heterochromatic LADs contain approximately 4,000 transcriptionally 
inactive genes’. We found that more than 90% of HIV RIGs lay outside 
LADs, while almost 80% of cold genes were inside LADs (P < 0.001 
compared with a random gene distribution; Fig. 3g). Immuno-FISH 
images for three of these cold genes confirmed their localization close 
to the nuclear envelope in primary CD4"* T cells (Fig. 3h). Finally, when 
all the 1,344 known LADs were aligned by their left or right borders, 
87.2% of RIGs were found outside the LADs, in contrast to a random 
distribution of genes (68.2%; P < 0.001, also taking into account the lower 
gene density within LADs; Fig. 3i). 

Transcriptionally active genes at the nuclear periphery are often asso- 
ciated with the nuclear pore complex (NPC)”*”’. We therefore assessed 
interaction of the HIV-1 provirus with the NPC by ChIP assays in primary 
CD4* T cells (primer scheme and controls in Extended Data Fig. 6a, b). 
At 4 days after infection, when RNA Pol2 and the USF1 and p65/RelA 
transcription factors were associated with the viral DNA as expected”, 
both the mAb414 antibody, which recognizes phenylalanine-glycine 
(FG)-repeats in nucleoporins, and specific antibodies against Nup153, 
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Figure 2 | Integrated, transcriptionally active HIV-1 is found at the nuclear 
periphery. a, b, Quantification of integrated HIV-1yy4-3/n-r. DNA (a) and 
HIV RNA (b) by real-time Alu PCR in infected CD4* T cells. c-f, Three- 
dimensional immuno-DNA FISH of HIV-1 DNA (green) in primary CD4* 
T cells infected ex vivo with HIV-1n14-3/p-r- (¢) and HIV-1gru (d), or directly 
obtained from two patients infected with HIV-1 (e, f). g, Three-dimensional 
immuno-DNA FISH of HIV-1 DNA in activated CD4" T cells infected with 
the mutant viruses IN(D64E) or IN(D116N) or with HIV-1yz4-3/g-p. upon 
raltegravir treatment. h, i, Real-time Alu PCR (h) and 2-LTR quantification (i) 
in the cells treated as in g. ND, not determined. j, Western blot (WB) showing 


Zone: 1 2 3 


Nup98, Nup62 and Tpr all immunoprecipitated the HIV-1 DNA; bind- 
ing was also observed for the NPLOC4 RIG gene, but not for the LAD 
gene PTPRD (Extended Data Fig. 6c). When ChIP was performed on the 
IN-defective D64E virus, no viral DNA was detected using the mAb414 
and anti-Nup153 antibodies (Extended Data Fig. 6d). 

Next, we aimed to verify whether HIV-1 localization changed when 
the virus reverted from a transcriptionally inactive to an active state. 
In the latent T-cell J-Lat clone 15.4 (ref. 28), the HIV-1 DNA retained 
its gross peripheral localization both in inactive and in TPA (12-O- 
tetradecanoylphorbol-13-acetate) phorbol ester-reactivated conditions 
(Extended Data Fig. 7a, b). Similar results were obtained in a primary 
model of HIV-1 latency’* (Extended Data Fig. 7c-e). However, when 
localization was analysed at molecular resolution by ChIP using the 
mAb414, anti-Tpr and anti-Nup153 antibodies, binding of the proviral 
region located downstream of the TSS to the nucleoporins was observed 
upon transcriptional activation but not in latent conditions (Extended 
Data Fig. 7f). We also observed that nucleoporins directly participated 
in HIV-1 transcriptional regulation. When Tpr and Nup153 were silenced 
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protein levels for LEDGF/p75 and Nup153 at the moment of HIV-1 infection, 
36h after short interfering RNA (siRNA) transfection. NT2, non-targeting 
siRNA. k, Real-time Alu PCR in Jurkat cells infected with HIV-1y14.3 and 
previously transfected with a non-targeting siRNA (NT) or an siRNA targeting 
LEDGF/p75. Samples were normalized over control-infected cells. 1, Three- 
dimensional immuno-DNA FISH for HIV-1 DNA visualization upon Jurkat 
cell treatment with the indicated siRNAs. NT2 and NT5 are two non-targeting 
siRNAs. All graphs, except those relative to patients’ cells, show mean and 
s.e.m. of at least three independent experiments. 


by RNAi in latent J-Lat cells, proviral transcription was significantly 
reduced (Extended Data Fig. 7g, h). Similarly, downregulation of Tpr 
also blunted LTR-driven gene expression in HIV-1-infected HeLa cells 
(Extended Data Fig. 8a-e). 

Our findings show that the cellular genes that are highly targeted by 
HIV-1 are distributed in a topologically non-random manner, being 
positioned within 1 1m from the nuclear edge; these genes are enriched 
in open chromatin marks, excluded from the LADs and associated with 
the NPC. Thus, the HIV-1 pre-integration complex preferentially targets 
those areas of open chromatin that are proximal to the nuclear pore, 
while excluding the internal regions in the nucleus as well as the peri- 
pheral regions associated with the nuclear lamina (model in Extended 
Data Fig. 9). The localization of HIV-1 proviral DNA in close association 
with the nuclear pore is consistent with several observations showing 
that different NPC components play a role in HIV-1 infection®”?”*. 

Why does the viral DNA integrate into the NPC compartment? A 
possibility that we favour is that the virus simply integrates into the first 
open chromatin regions it meets along its route into the nucleus. This is 
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Figure 3 | HIV RIGs are transcriptionally active genes that are excluded 
from the LADs. a, Localization of the IKZF3 (a) and TAP2 (b) genes (green) in 
CD4* T cells and CD34* haematopoietic stem cells. b-f, Distributions of Pol2, 
acetylated H3K9, H3K36me3 and H3K4me2/3 around the TSSs of HIV 

RIGs (red) and cold genes (green), compared with highly active (black) and 
silent (blue) genes in activated CD4* T cells. g, Cross-comparison of different 
lists of integration loci, including HIV RIGs, with the lists of genes present 
inside LADs: HIV integration loci are significantly depleted in LADs compared 


likely to be related to the short life of viral integrase’® and thus the need, 
for the pre-integration complex, to achieve rapid integration into geno- 
mic DNA upon its entry into the nucleus. This interpretation is consis- 
tent with our observation of more dispersed, unintegrated viral cDNA 
in all conditions in which integrase function is impaired. 

Finally, while adding a three-dimensional view to the process of HIV-1 
integration, our results also indicate that the localization of the HIV-1 
DNA in close correspondence with the nuclear pore has functional rele- 
vance, since it appears important for productive HIV-1 gene expression. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Virus production. For the production of viral stocks, we used a plasmid obtained 
from the Env- molecular clone pNL4-3/E R , a gift from N. Landau. This viral 
clone harbours a frameshift mutation introduced near the 5’ end of the env gene*! 
and performs a single-round infection once pseudotyped with vesicular stomatitis 
virus-G (VSV-G); this renders the virus incapable of spreading (and causing mas- 
sive cell death). 

We produced viral clone HIV-1ggu as previously described’*. The integrase (IN) 
defective packaging plasmid pcHelp/IN-, a gift from A. Cara, contains a D116N 
mutation in the IN genome, preventing the function of the IN protein'*®. The D64E 
mutant plasmid, which is similarly integration-defective, was obtained from the 
National Institutes of Health (NIH) AIDS Research and Reference Reagent Program’’. 

Lentiviral vector pLbV-THM” was obtained from Addgene, whereas pCCL18- 
green fluorescent protein (GFP) was modified from ref. 33 to become promoter- 
less. The gammaretroviral MoMLV vector was a gift from G. Towers. 

Infectious viral stocks were generated by transfecting viral DNA in HEK 293T 

cells and collecting supernatants after 48h. Viral production was quantified by 
measuring viral p24 in the supernatants using the Innotest HIV antigen mAB kit 
(Innogenetics). 
Primary cell isolation, culture and infection. Primary human CD4* T cells were 
isolated by Ficoll gradient separation, followed by purification with CD4 MicroBeads 
(Miltenyi Biotec). Cells were activated with a cocktail of beads containing 4.5 X 10° 
beads coated with «CD3 and «CD28 antibodies (Dynabeads Human T-Activator 
CD3/CD28 Dynal/Invitrogen), and plated in complete medium with interleukin-2 
(IL-2; 30 U ml}, Sigma-Aldrich) for 4 days at 37 °C. 

Activated CD4* T cells (1 X 10°) were infected with 0.5-0.75 ug ml | of viral 
p24 for 4-5 h at 37 °C. After infection, cells were kept in culture at 1 X 10° cells per 
millilitre in complete RPMI 1640 medium supplemented with IL-2 and CD3/CD28 
beads. 

For raltegravir treatment, 10 tM raltegravir (obtained from the NIH AIDS Research 
and Reference Reagent Program) was added together with the virus during the 
infection, and it was later supplemented in the medium. 

For generating the primary model of latency, naive CD4™ T cells were isolated, 
cultured and infected as described in ref. 14. 

Primary human CD34* cells were isolated, cultured and infected as described in 
ref. 34. Primary human macrophages were isolated and cultured as described in 
ref. 35. 

Patients infected with HIV and having a CD4 count less than 3 X 10° ml‘ were 
enrolled before starting highly active antiretroviral therapy, following informed con- 
sent. Peripheral blood mononuclear cells obtained from healthy or infected blood 
donors according to a study protocol approved by the Ethical Committee of the 
Azienda Ospedaliero- Universitaria ‘Ospedali Riuniti di Trieste’, Italy, were iso- 
lated as described previously’®. 

Cell culture and transfection. The Jurkat lymphoblastoid cell line, Jurkat J-Lat 
15.4 clone and U937 monocytic cell line were kept in culture in complete RPMI 
1640 medium with the addition of 10% fetal bovine serum (FBS). Cells were tested 
for mycoplasmal cell culture contaminants by using a MycoAlert kit from Lonza. 

Transfection of Jurkat with p-eGFP-Nup153 expression plasmid” obtained from 
Euroscarf was done using Fugene HD (Promega), according the manufacturer’s 
instructions. For the RNAi experiments, 3 X 10° cells were transfected with siRNA 
smart pools targeting LEDGF/p75 (PSIP1) (Dharmacon M-015209) or Nup153 
(Dharmacon M-005283) proteins or with a non-targeting (NT) siRNA as a nega- 
tive control (Dharmacon, Thermo Scientific). Transfection was performed with 
the Amaxa Nucleofection Device II (Amaxa), using an Amaxa nucleofection Kit V 
according to the manufacturer’s instructions. 

For western blot analysis, cells were harvested and homogenized in lysis buffer 
(20 mM Tris-HCl, pH 7.4, 1 mM EDTA, 150 mM NaCl, 0.5% Nonidet P-40, 0.1% 
SDS, 0.5% sodium deoxycholate) supplemented with protease inhibitors (Roche) 
for 10 min at 4 °C and sonicated (Bioruptor) for 5 min. Equal amounts of total cel- 
lular proteins (30 1g), as measured with Bradford reagent (Biorad), were resolved 
by 8% SDS-polyacrylamide gel electrophoresis (SDS-PAGE), transferred onto poly- 
vinylidene difluoride membranes (GE Healthcare) and then probed with primary 
antibody (anti- Nup153 (Santa Cruz, sc-101544), anti-LEDGF/p75 (BD Biosciences, 
611714), anti-GFP (LifeTechnologies, A6455)), followed by secondary antibody 
conjugated with horseradish peroxidase. The immunocomplexes were visualized 
with enhanced chemiluminescence kits (GE Healthcare). 

Once protein silencing was assessed, cells were infected with 0.5-0.75 wg ml’ 

of p24 of viral clone NL4_3/z-p_ as described. At 24 and 48 h after infection, samples 
were collected for further analysis. 
Integration assay (Alu PCR). Infected cells were tested for integration of HIV-1 
by isolating genomic DNA from 1 X 10° cells with a DNeasy Tissue Kit (Qiagen). 
Genomic DNA (100 ng) was subjected to quantitative Alu-LTR PCR for integrated 
provirus or for 2-LTR circles as previously described"*. 


Quantitative reverse transcription PCR. For the quantification of HIV transcript 
levels, RNA was purified from the cells with a Nucleospin RNA II purification kit 
(Macherey-Nagel). The messenger RNA (mRNA) levels were quantified by TaqMan 
quantitative reverse transcription PCR (qRT-PCR) using HIV-1 or interleukin-2 
(IL-2) primers and probe“, and housekeeping gene 18S and GAPDH as controls. 
Luciferase activity assay. Cells were harvested 48 h after infection, and luciferase 
activity was measured using the Luciferase Assay Kit (Promega). Viral expression 
was expressed after normalization over micrograms of total cell extracts. 

Cell preparation for three-dimensional immuno-DNA FISH. Three-dimensional 
FISH combined with immunostaining was performed according to protocols in 
ref. 37. Culture or primary cells were resuspended at 3 X 10° cells per millilitre in 
5% FBS in PBS and cell suspension was allowed to attach to the glass cover slips, 
previously coated with poly-t-lysine. Cells were fixed in 4% paraformaldehyde in 
0.3X hypotonic PBS for 10 min, permeabilized with PBS/0.5% Triton X-100 for 
10 min and left in PBS/20% glycerol for 1 h. Cells were then blocked in PBS/5% fetal 
horse serum (FHS) for 45 min, and primary antibody (anti-NPC mAb414, Covance; 
anti-LaminB, Abcam ab16048; anti-GFP, LifeTechnologies A6455) was added for 
an overnight incubation at +4 °C in a humid chamber. The subsequent day, cells 
were washed five times in PBS-T (PBS with 0.05% Tween) and the secondary anti- 
body (Jackson Laboratories) was used for 45 min at 22 °C (1/1,000 dilution). After 
five washings in PBS-T, cells were additionally crosslinked with EGS (ethylene 
glycol-bis(succinic acid N-hydroxysuccinimide ester) (Sigma E-3257) for 10 min, 
washed and permeabilized again in PBS-T/0.5% Triton X-100. After washing in 
PBS/0.05% Triton X-100, cells were rinsed and incubated in 0.1 N HCl (freshly pre- 
pared) for 10 min. Cells were left in PBS/20% glycerol for at least 45 min, and then 
subjected to five cycles of freeze and thaw in liquid nitrogen and PBS/20% glycerol. 
Additional washings in PBS/0.05% Triton X-100 preceded an overnight incuba- 
tion in 50% formamide/2X SSC (hybridization buffer). The subsequent day, cells 
were treated with RNase A (100 pg ml | in 2X SSC) ina humid chamber at 37 °C 
for 1 h, were rinsed again in 50% formamide/2X SSC for at least 1 h (or overnight) 
and were then subjected to hybridization with the appropriate probe. 

Probes for hybridization in three-dimensional immuno-DNA FISH. For visu- 
alization of the loci of interest, specific BAC clones (selected from CHORI (Children’s 
Hospital Oakland Research Institute in Oakland, California) sites and purchased 
from Invitrogen) were isolated according to the manufacturer’s instructions. The 
listing of the BACs with their identities and the genes they contain is provided in 
the Supplementary Information. BAC DNA (2 lg) was labelled with digoxigenin 
by Dig-Nick Translation (Roche) at 15 °C. 

For the visualization of HIV-1, lentiviral or gamma-retroviral vector DNA inte- 
grated inside Jurkat or primary CD4" T cells, 2 jig of the respective plasmids was 
labelled by nick translation in the presence of 16-dUTP Biotin nucleotides at 15 °C 
for 3h. 

In both cases, probes were checked on agarose gel and then cleared by using an 

Illustra Microspin G-25 column (GE Healthcare) and precipitated in the presence 
of Cot-1 DNA (Roche) and DNA from herring sperm (Sigma). Finally, after ethanol 
precipitation, the probes were resuspended in 10 kl formamide, incubated at 37 °C 
for 15-20 min and 10 pl of 20% dextran in 4X SSC was added to a final volume of 
20 pl. 
Hybridization set up and development. The probe (1-10 jl) was loaded onto glass 
cover slips with the cells, followed by sealing with rubber cement, and heat-denatured 
ona heat block at 75 °C for 4 min. Hybridization was performed for 48 h at 37 °C 
in a humid chamber. Three washings in 2X SSC (10 min each) were followed with 
three washings in 0.5X SSC at 56 °C. 

FISH development for Dig-labelled BACs was performed by using FITC-labelled 

anti-digoxigenin antibody (Roche), whereas biotin-labelled HIV-1 probes were 
detected by a TSA Plus system from Perkin Elmer, allowing signal amplification, 
by using an anti-biotin antibody (SA-HRP) and a secondary antibody with a fluo- 
rescent dye (usually FITC for HIV). 
Microscopy. Three-dimensional-stacks of slides with fixed cells were captured on 
a Zeiss LSM 510 META confocal microscope (Carl Zeiss Microimaging) with a 63 
numerical aperture 1.4 Plan-Apochromat oil objective. The pinhole of the micro- 
scope was adjusted to obtain an optical slice of less than 1.0 1m for any wavelength 
acquired. 

Distances observed between the FISH signals and the nuclear envelope were 
measured using LSM 510 Image Examiner Software (Zeiss) and Volocity (Perkin 
Elmer); measurements were normalized over nuclear radius (defined as half of the 
middle of the mAb414-TRITC ring), and then binned into three classes of equal 
surface area". 

Measurements were acquired for the alleles of the following genes in activated 
CD4* T cells: NPLOC4 (n = 90), FKBP5 (n = 128), NFATC3 (n = 68), HEATR7A 
(n =78), RPTOR (n = 94), SPTAN (n = 72), SMGI (n = 80), GRB2(n = 132), KDM2A 
(n = 102), DNMT1 (n = 136), hotter zone 1 (n = 118), hotter zone 2 (n = 100), 
hotter zone 3 (n = 62), hotter zone 4 (n = 154), GAPDH (n = 52), ACTNI (n = 126), 
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CD4 (n= 100), CD28 (n = 52), HEATR6 (n = 44), KDM2B (n = 106), PACS2 
(n = 42), IKZF3 (n = 52), TAP2 (n = 54), CNTN4 (n = 56), GPCS5 (n = 30), PIPRD 
(n = 30). Measurements were acquired for the alleles of the following genes in 
CD34* cells: IKZF3 (n = 54) and TAP2 (n = 60). Measurements were acquired for 
the proviruses in primary macrophages (n = 18) and the U937 cell line (n = 30). 
Measurements were acquired for the proviruses in primary cells upon several con- 
ditions: 4 days after infection of activated CD4* T cells (n = 160 HIV-In12-3/p-r- 
and n = 42 HIV-1gry measured in three independent experiments); HIV-1 in 
CD4* T cells from infected patients (n = 28 and n = 27); 4 days after infection 
with mutant viruses IN(D64E) (n = 30) or IN(D116N) (n = 66), or cells infected 
with HIV-1y,43 upon raltegravir treatment (n = 159); latent CD4* cells (2 weeks 
after infection) with or without CD3/CD28 stimulation (n = 40 and n = 33, respec- 
tively). Measurements were acquired for proviruses in Jurkat or J-Lat cell lines 
4 days after infection of Jurkat with HIV-1n14-3/z-r- in different conditions: no 
transfection (control, n = 116), transfections with non-targeting siRNA (siNT2/NT5, 
n= 163), LEDGF/p75 siRNA (siLEDGF, n = 164), Nup153 (siNup153, n = 129), 
Nup153 siRNA + enhanced GFP (eGFP)-Nup153 (1 = 52). The corresponding 
graphs show the average results from three independent experiments. Proviral 
DNA was analysed in the J-Lat clone 15.4 with and without TPA (n = 74 and 
n= 150, respectively); Jurkat + Lentiviral promoter (m = 19); Jurkat + transcription- 
less lentiviral vector (n = 51); Jurkat + gammaretroviral vector (n = 88). 

ChIP. CD4* T cells (40 X 10°) were washed twice in PBS before crosslinking with 
1% final formaldehyde for 10 min at room temperature, followed by termination 
of the reaction with 125 mM glycine on ice. The cell pellet was washed twice with PBS 
and was lysed in 0.5% NP-40 buffer (10 mM Tris-Cl pH7.4, 10 mM NaCl, 3mM 
MgCl, 1mM PMSF and protease inhibitors). The nuclei obtained were washed 
once in the same buffer without NP-40. Lysis of the nuclei was performed using the 
same buffer containing 4% of NP-40 at 37 °C for 15 min, upon which micrococcal 
nuclease was added (120 units of the enzyme), and the reaction was stopped with 
3mM EGTA. DNA was additionally sheared by sonication to an average size of 
DNA fragments below 500 base pairs. Extracts were pre-cleared by two rounds of 
incubation with immunoglobulin-y and agarose beads, followed by centrifugation 
at 1,200g for 5-10 min. The lysate (400 il) was then incubated with 2-4 ug of the indi- 
cated antibody overnight at 4 °C, followed by incubation for 4h with MagnaChIP 
Protein A/G Magnetic Beads (Millipore). Beads were then washed thoroughly with 
RIPA150, with LiCl-containing buffer and with TE, RNase treated for at least 30 min 
at 37 °C, and treated with proteinase K for at least 2 h at 56 °C. De-crosslinking of 
protein-DNA complexes was performed by an overnight incubation at 65 °C. DNA 
was then extracted by phenol-chloroform extraction followed by ethanol precipi- 
tation and was quantified by real-time PCR. The following antibodies were used 
in the ChIP experiments: mAb414 (Covance, MMS-120R), anti-Pol2 (Santa Cruz, 
sc-9001X), anti- USF1 (Santa Cruz, sc-229X), anti- NF-«B p65 subunit (Santa Cruz, 
sc-109X), anti-Nup153 [SA1] (Abcam, ab-96462) and anti-Nup153 (QE5) (Abcam, 
ab-24700), anti-Nup98 (Cell Signaling, 2598), anti- Nup62 (BD Biosciences, 610497), 
anti-Tpr (Abcam ab-58344), anti-Mcm2 (Abcam), mouse IgG (Santa Cruz, sc-2025). 
The graphs of ChIP figures show the mean and s.e.m. from at least three indepen- 
dent experiments. 

Bioinformatics and statistical analysis. No statistical methods were used to pre- 
determine sample size. 

Five lists of HIV-1 integration sites were collected from published work****° 
and an unpublished list of integration sites in CD4* T cells provided by A.R. and 
F.M. (Extended Data Table 1). HIV-1 RIGs (n = 156) were genes found in more 
than one list (Supplementary Information and Extended Data Table 1); their genomic 
position was plotted onto the chromosome map using the Idiographica webtool 
(http://www.ncrna.org/idiographica (ref. 9); Extended Data Fig. 2). Genomic coor- 
dinates of eight selected hotter zones, into which HIV-1 integration density was 
found higher than expected, were downloaded from the Bushman Lab website 
(http://www. bushmanlab.org/tutorials/ucsc)' and reported in the Supplementary 
Information. 

The calculation of the probability of finding 156 genes present in more than one 
list by chance was performed by computer simulation. A program was written to 
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randomly draw, from 25,000 genes, 265, 329, 294, 32, 158 and 58 genes, and to count 
the genes drawn more than once. The simulation was repeated 1 X 10° times. The 
distribution obtained is shown in Extended Data Fig. 1. Calculations were per- 
formed using the Matlab 2011R software (http://www.mathworks.com/). 

Expression of HIV RIGs and control genes was derived from published tran- 
scriptomic data in CD4* T cells, using biogps.org (as in refs 12, 13, 22), and was com- 
pared with a random sets of genes. Using the non-parametric Mann-Whitney- 
Wilcoxon test, it was concluded that that HIV RIGs and control genes are more 
transcribed than a random sets of genes (P < 0.005). 

LAD coordinates were obtained from ref. 5, whereas genes inside LADs were 
derived from BioMart Ensembl and named LAD genes. Then, the P value of the 
common genes in the LAD genes, HIV RIGs, the six integration lists and cold genes 
were calculated by pairwise comparison of each combination, followed by hyper- 
geometric test (Fig. 3g). The following P-values were obtained: ref. 3, P= 1.27 x 10°; 
ref. 2, P= 0.008; ref. 38, P = 0.001; ref. 40, P = 0.36; ref. 39, P = 0.0007; F.M. et al. 
(unpublished observations), P = 1.32 X 107s ref. 1 hotter zones, P = 0.0003; HIV 
RIGs, P = 2.09 X 10 '°. Eighty per cent of genes that are never targeted by HIV-1 
(cold genes) are significantly enriched inside LADs (P = 3.25 X 101). 

The profile of aligned LAD border regions (Fig. 3i) was performed as described 
in ref, 5. A y’ test was applied to compare the distribution across the LAD border of 
HIV RIGs with the one of 3,000 random genes that were generated without replace- 
ment using the RSA-tool (http://floresta.eead.csic.es/rsat/random-genes_form.cgi)"’. 

ChIP-seq profile analyses (Fig. 3b-f and Extended Data Fig. 6) were performed 
as in refs 22, 42. The 1,000 most expressed and 1,000 least expressed genes were 
obtained as in ref. 22. and named active and silent genes, respectively. The TSS coor- 
dinates of these genes were obtained using the University of California, Santa Cruz 
(UCSC) Table Browser. 

Comparison between groups for expression data was performed using a non- 
parametric Mann-Whitney- Wilcoxon rank sum test; comparison of gene distri- 
butions was by a 7’ test, with the exception of data shown in Fig. 3g, which were 
analysed by a hypergeometric test. For the FISH, ChIP and real-time PCR results, 
the reported values are means and s.e.m., calculated from at least three independent 
samples. For statistical comparison of three or more groups, one-way analysis of 
variance followed by Tukey’s post-hoc test was used. A value of P < 0.05 was con- 
sidered significant. 
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Extended Data Figure 1 | HIV-1 RIGs. a, Probability of recurrence of a 
random set of genes in different lists of HIV-1 integration sites. The histogram 
shows the distribution of the number of genes present at least twice in the six 
HIV-1 integration site lists considered; 1 X 10° independent drawings were 
evaluated. The distribution peaks around 20 genes, with a maximum observed 
of 50. The number of RIGs detected experimentally in at least two lists was 
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instead 156 (P< 1X 10°). b, Human chromosome map showing the 
localization of 156 HIV RIGs. Genes found in four, three and two HIV-1 
integration lists are highlighted in red, orange and black, respectively. Hotter 
genomic regions, favoured for HIV-1 integration as described in ref. 1, are 
highlighted in blue and indicated by a star. 


Limited. All rights reserved 


LOCUS 


6p21.31 


6orf106 = SRPKI CMTRI 


other 


PSORSIC] integration sites 


TOT. 
-5Mb +5Mb 


9934.11 
RIGs 
GLEI FINBP1 POMT1 


other 
integration sites 


MAPKAP1 
DENND1A 


TOT. 
-5Mb +5Mb 
11q13.2 MARKT MALATI 
RIGs 
HNRNPUL2 
GANAI other 
PPFIAT numa1 | integration sites 
TOT. 
-5Mb +5Mb 
16p12.3 


RIGs 


POXDC1 cl6orf52 


other 
integration sites 


MKL2 DCUN1D3 


TOT. 
-5Mb +5Mb 


16q22.1 
RIGS 


other 
integration sites 


TOT. 
-5Mb 


17q25.1-3 EXOC7 cCDCS7 
iesnkio RIGs 
ABCAIO 
other 
integration sites 
-10Mb +10Mb TOT. 
19p13.2 ane 
ZNF490 RIGs 
HNRNPM "he 
other 
integration sites 
| TOT. 
-5Mb +5Mb 
8q24.3 
a RIGs 
other 
HEATR7A integration sites 
SSeS eS eee eee | 
-5Mb +5Mb 


Extended Data Figure 2 | Distribution of RIGs or individual integration 
sites all over the loci analysed by FISH. The scheme describes the distribution 
of RIGs (bold) or simple integration sites (regular) around the locus analysed by 
FISH in Fig. 1; RIGs are in the left panel and hotter zones are in the right 
panel. As indicated on the side, the total number of RIGs/integrants was 


LETTER 


pOCus within] 1 Mb | 5 Mb |10 Mb 
4q25 
RIGs 7) 0 0 
CENPE CCDC1098 
LEFL CAMKZD, 3 other Pe 1 4 5 
integration sites 
HZ1 
TOT. 1 4 5 
-5Mb. +5Mb 
8q24.3 
RIGs 1 | 1 
other 
integration sites ui 2 2 
TOT. 8 10 10 
-5Mb +5Mb 
19q13.42 
RIGs 0 (0) 1 
U2AF2 ZNF460, 
CCDC106} other 
ZNFA470 | ZNF814 integration sites 0 5 iG 
TOT. 0 5 8 
-5Mb +5Mb 
8q24.21 
RIGs 0 o | 0 
KIAAO196 ASAPT 
EFR3A other 
LRPI2 PHF20L1 integration sites : 7 
TOT. 2 7 v 
-5Mb +5Mb 


eg 


calculated within 1, 5 or 10 Mb from the locus analysed by FISH. In total, 
considering all the RIGs and hotter zones, there are 44 other RIGs/integrants 
within a window of 1 Mb, 116 within 5 Mb and 169 within 10 Mb around the 
analysed locus. 
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Extended Data Figure 3 | FISH analysis. a, Representative images of 
three-dimensional immuno-DNA FISH of HIV-1 DNA (green) in human 
primary HIV-1 macrophages stained for mAb414 (red), with relative 
distribution of FISH signals according to the three concentric zones. b, FISH 
of HIV-1 DNA (green) in the HIV-1 infected U937 monocytic cell line. 

c, Representative images of three-dimensional immuno-DNA FISH of 
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lentiviral vector pLlV-THM (green) in Jurkat cells. d, Representative images of 
three-dimensional immuno-DNA FISH of the promoter-less lentiviral vector 
pCCL-18GEP (green) in Jurkat cells. e, Representative images of three- 
dimensional immuno-DNA FISH ofa gammaretroviral vector (green) in Jurkat 
cells. For all panels, the graphs are organized as described in the main text. 
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Extended Data Figure 4 | Reconstitution of Nup153 by transfection of an 
siRNA-resistant plasmid coding for eGFP-Nup153. a, Scheme of the 
experiment performed in Jurkat cells. eGFP-Nup153* contains the coding 
region for Nup153 tagged with eGFP, but is devoid of the 3’ untranslated region 
of the mRNA, which is the target of the anti-Nup153 siRNA4. b, Western 
blot showing Nup153 protein level at the moment of infection. siNT, not 
targeting siRNA. c, Real-time Alu-PCR in Jurkat cells 2 days after infection with 


HIV-1y14.3. Values are mean and s.e.m. of three experiments after 
normalization over Jurkat transfected with a control, non-targeting siRNA 
(siNT). d, Representative images of three-dimensional immuno-DNA FISH of 
HIV-1 DNA (red) in Jurkat cells transfected first with the eGFP-Nup153* 
expression plasmid and then with the siRNA4, targeting endogenous Nup153. 
The graph on the right side shows the distribution of HIV-1 FISH signals 
according to the three concentric zones in cells expressing eGFP. 
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Extended Data Figure 5 | ChIP-seq profiles for HIV RIGs, cold genes and 
controls. a-e, Profiles of chromatin modifications around the TSS for HIV 
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silent (blue) genes in activated CD4* T cells. Each panel reports results for a 
specific modification, as indicated. 
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Extended Data Figure 6 | Association of HIV-1 provirus with nucleoporins. 
a, Positions of primers used for ChIP on the HIV-LTR (numbering is according 
to the TSS and nucleosomes are shown), the NPLOC4 RIG, the PTPRD 

cold gene, B48 and B13 genomic controls for DNA standardization. B48 maps 
within the human lamin B2 origin of DNA replication. b, Control ChIP data in 
CD4* T cells infected with HIV-1yy4-3/p-r» using total immunoglobulin-y 
and an antibody against the unrelated Mcm2? cellular protein. For each analysed 
region, the amount of immunoprecipitated chromatin using the indicated 
antibodies was normalized according to the input amount of chromatin. Mean 
and s.e.m. from at least three independent experiments. **P < 0.01. c, ChIP 
results in CD4* T cells, 4 days after HIV-1 infection, using the indicated 
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antibodies. The amount of immunoprecipitated chromatin was normalized 
according to input. Mean and s.e.m. from at least three independent 
experiments. **P < 0.01,*P < 0.05. d, ChIP results in CD4° T cells, 4 days 
after infection with wild-type HIV-1 or the IN(D64E) mutant virus. For 

the PPR1 region, corresponding to the viral promoter, the amount of 
immunoprecipitated chromatin using the indicated antibodies (mAb414, 
Nup153 and Pol2) was calculated according to the input amount of chromatin, 
and then normalized over the B13 control genomic region. The graphs show 
the mean and s.e.m. from three independent experiments. ***P < 0.001; 
*P<0.05. 
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Extended Data Figure 7 | HIV-1 transcriptional activation is concomitant 
with, and requires, nucleoporins binding to the provirus. a, Quantitative 
reverse transcription PCR measurement of HIV-1 mRNA in mock- or 
TPA-treated J-Lat 15.4 cells. b, Three-dimensional immuno-DNA FISH of 
HIV-1 DNA (green) in J-Lat 15.4 cells stained for NPC (red) before and 
after TPA reactivation. c, Scheme of the experiment for the generation of a 
primary, cellular model of HIV-1 latency to study HIV-1 DNA localization 
in activated and resting primary CD4* T cells. d, Quantitative reverse 
transcription PCR measurement of HIV-1 mRNA levels in primary infected 
CD4* T cells before and after reactivation, normalized over the 18S 
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housekeeping gene. Latent versus reactivated: P < 0.001. e, Three-dimensional 
immuno-DNA FISH of HIV-1 DNA (green) in latently infected CD4* T cells 
stained for the NPC (red) before and after reactivation, with relative 
distribution of HIV-1 FISH signals according to the three concentric zones 
considered in this work. e, ChIP in control and TPA-stimulated J-Lat 15.4 cells, 
with the indicated antibodies. Mean and s.e.m. from at least three independent 
experiments. g, Immunoblot for Tpr (upper panel) and Nup153 (lower 
panel), 36h after transfection of the indicated siRNAs (NT, non-targeting 
control). h, Levels of HIV-1 RNA in siRNA-treated J-Lat 15.4 cells after TPA 
activation. Mean and s.e.m. from three independent experiments. *P < 0.05. 
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Extended Data Figure 8 | Silencing of Tpr in HeLa cells and 15.4 J-Lat 
clones. a, Scheme of the experiment to study HIV-1 integration in infected 
HeLa cells after Tpr silencing. b, Western blot showing Tpr protein level at the 
moment of infection, after treatment of HeLa cells with a non-targeting 
siRNA (siNT) or an siRNA targeting Tpr at two different doses. Values are 
mean and s.e.m. of three experiments after normalization over HeLa cells 
transfected with a control non-targeting siRNA. c, Real-time Alu PCR in HeLa 
cells infected with HIV-1y143 and previously transfected with a non-targeting 
siRNA (siNT) or an siRNA targeting Tpr at two different doses. Values are 
mean and s.e.m. of three experiments after normalization over HeLa cells 
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transfected with a control non-targeting siRNA. d, Luciferase activity assay in 
HeLa infected with HIV-1y14.3 and previously transfected with a non-targeting 
siRNA (siNT) or an siRNA targeting Tpr at two different doses. Values are 
mean and s.e.m. of three experiments. Statistical significance: ***P < 0.001; 
**P < 0.01. e, Real-time PCR quantification of IL-2 mRNA levels in J-Lat 
15.4 cells. The following conditions were tested: untreated cells, plus TPA (4h), 
transfection with non-targeting siRNA or an siRNA targeting Tpr for 24h, 
followed by treatment with TPA (4h). Values are mean and s.e.m. of three 
experiments after normalization over GAPDH. Transcription of interleukin-2 
(IL-2) was not significantly altered upon Tpr downregulation. 
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Extended Data Figure 9 | Model for HIV-1 integration site selection. After entry into the nucleus through the nuclear pore, the viral DNA integrates into the 
active chromatin closest to the NPC (green zones), avoiding both LADs and the inner part of the nucleus (red zones). 
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Extended Data Table 1 | List of HIV-1 integration sites considered in this work 


3: Nr. Published| Nr. Unique 
List Source nite Reference 
Sequences | Intragenic Sites 

Primary, activated 

Brady et al. CD4+ T cells, in vitro 524 265 (Brady et al., 2009) 
infection 
Primary, activated 

Mavilio et al. CD4+ T cells, in vitro 638 329 Unpublished 
infection 

eee erent Sup T1, in vitro 642 294 (Schroder et al., 
infection 2002) 

Liu et al. panic siaitd tiseues 42 32 (Liu et al., 2006) 
from HIV patients 

Ikeda et al. SE +t celis Rein ae 463 158 (Ikeda et al., 2007) 
patients 

Han et al. cbs Teels ion av 74 58 (Han et al., 2004) 
patients 

TOTAL 1136 

N. Genes in 4 lists 6 

N. Genes in 3 lists 24 

N. Genes in 2 lists 126 

TOTAL N. RECURRENT GENES 156 


Out of the indicated numbers of sequences identified by the six considered studies, 1,136 were within individual genes; of these, 156 recurred in two or more studies. 
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The Xist IncRNA interacts directly with SHARP to 
silence transcription through HDAC3 


Colleen A. McHugh'*, Chun-Kan Chen!*, Amy Chow!, Christine F. Surkal, Christina Tran’, Patrick McDonel’, 
Amy Pandya-J ones®**, Mario Blanco!, Christina Burghard'’, Annie Moradian’, Michael J. Sweredoski°, Alexander A. Shishkin’, 
Julia Su, Eric S. Lander”, Sonja Hess®, Kathrin Plath** & Mitchell Guttman! 


Many long non-coding RNAs (IncRNAs) affect gene expression’, 
but the mechanisms by which they act are still largely unknown’. 
One of the best-studied IncRNAs is Xist, which is required for 
transcriptional silencing of one X chromosome during develop- 
ment in female mammals**. Despite extensive efforts to define 
the mechanism of Xist-mediated transcriptional silencing, we still 
do not know any proteins required for this role*. The main chal- 
lenge is that there are currently no methods to comprehensively 
define the proteins that directly interact with a ncRNA in the cell’. 
Here we develop a method to purify a IncRNA from cells and 
identify proteins interacting with it directly using quantitative 
mass spectrometry. We identify ten proteins that specifically assoc- 
iate with Xist, three of these proteins—SHARP, SAF-A and LBR— 
are required for Xist-mediated transcriptional silencing. We show 
that SHARP, which interacts with the SMRT co-repressor® that 
activates HDAC3’, is not only essential for silencing, but is also 
required for the exclusion of RNA polymerase II (Pol II) from the 
inactive X. Both SMRT and HDAC3 are also required for silencing 
and PollII exclusion. In addition to silencing transcription, 
SHARP and HDAC3 are required for Xist-mediated recruitment 
of the polycomb repressive complex 2 (PRC2) across the X chro- 
mosome. Our results suggest that Xist silences transcription by 
directly interacting with SHARP, recruiting SMRT, activating 
HDAC3, and deacetylating histones to exclude PollII across the 
X chromosome. 

Over the last two decades, numerous attempts have been made to 
define the protein complexes that interact with Xist and that are 
required for its various roles in X-chromosome inactivation (XCI)’. 
Most studies have used prior knowledge of the molecular events that 
occur on the X chromosome to define potential Xist-interacting pro- 
teins*’. Although individual proteins that associate with Xist have been 
identified®*”°, we still do not know any of the proteins required for Xist- 
mediated transcriptional silencing because perturbations of these pro- 
teins, including components of the PRC2 complex, have no effect on 
Xist-mediated transcriptional silencing''’*. Current methods for iden- 
tifying IncRNA-interacting proteins either require selection of specific 
candidate interacting proteins or fail to distinguish between direct 
RNA interactions that occur in the cell from those that merely assoc- 
iate in solution (reviewed in ref. 5). 

To develop a method for identifying the proteins that directly inter- 
act with a specific IncRNA in vivo, we adapted our RNA antisense 
purification (RAP) method’ to purify a ncRNA complex and identify 
the interacting proteins by quantitative mass spectrometry (RAP-MS) 
(Methods, Fig. 1a). Briefly, RAP-MS uses ultraviolet (UV) cross- 
linking to create covalent bonds between directly interacting 
RNA and protein and purifies IncRNAs in denaturing conditions to 
disrupt non-covalent interactions (Methods). This UV-crosslinking 


and denaturing approach, which is used by methods such as cross- 
linking and immunoprecipitation (CLIP), is known to identify only 
direct RNA-protein interactions and to separate interactions that are 
crosslinked in the cell from those that merely associate in solution®™*. 

Adapting this UV-crosslinking and denaturing approach to enable 
purification of a specific IncRNA is challenging for several reasons. (1) 
To purify IncRNA complexes in denaturing conditions, we need an 
RNA capture method that can withstand harsh denaturing conditions. 
(2) To detect the proteins associated with a given IncRNA, we need to 
achieve high purification yields of a IncRNA complex because, unlike 
nucleic acids, we cannot amplify proteins before detection. (3) Because 
any individual RNA is likely to be present at a very low percentage of 
the total cellular RNA, we need to achieve high levels of enrichment to 
identify specific interacting proteins. (4) Because the number of back- 
ground proteins will be high, even after enrichment, we need accurate 
and sensitive methods for protein quantification to detect specific 
IncRNA-interacting proteins. 

The RAP-MS method addresses these challenges because (1) 
RAP uses long biotinylated antisense probes, which form very stable 
RNA-DNA hybrids, and therefore can be used to purify IncRNA com- 
plexes in denaturing and reducing conditions (that is, 4 M urea at 67 °C, 
see Methods). (2) We optimized the RAP method to achieve high yields 
of endogenous RNA complexes. In our original protocol'*, we achieved 
<2% yield of the endogenous RNA complex; by optimizing hybridiza- 
tion, washing, and elution conditions (Methods), we were able to repro- 
ducibly achieve ~70% yield (Extended Data Fig. 1a, Methods). (3) 
Using our optimized conditions, we increased the enrichment levels 
for the target ncRNA complex (~5,000-fold, Extended Data Fig. 1b) 
relative to our already high levels of enrichment achieved previously 
(~100-fold)’’. (4) To achieve sensitive quantification and to distinguish 
between specific proteins and background proteins, we used stable 
isotope labelling by amino acids in culture (SILAC) to label proteins 
(Methods, Extended Data Fig. 1c), which enables quantitative compar- 
isons of purified proteins by mass spectrometry”. 

We validated the RAP-MS approach by defining the proteins that 
interact with two well-characterized non-coding RNAs: UI (a core 
component of the spliceosome) and 18S (a component of the small 
ribosomal subunit). In the U1 purifications, we identified 9 enriched 
proteins, all of which are known to interact with U1 (Supplementary 
Note 1). In the 18S purification, we identified 105 enriched proteins; 98 
of these (93%) were previously characterized as ribosomal proteins, 
ribosomal processing and assembly factors, translational regulators, or 
other known ribosome interactors (Extended Data Fig. 2). In particular, 
we identified 21 of the 31 known small ribosomal subunit proteins. The 
few missing proteins appear to fall predominately into two categories: 
proteins that make few direct contacts with the RNA and small proteins 
that contain few peptides that could be detected by mass spectrometry. 


Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125, USA. “Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02139, USA. 
3Department of Biological Chemistry, Jonsson Comprehensive Cancer Center, Molecular Biology Institute, University of California Los Angeles, Los Angeles, California 90095, USA. *Eli and Edythe Broad 
Center of Regenerative Medicine and Stem Cell Research, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California 90095, USA. °Proteome Exploration Laboratory, 


Beckman Institute, California Institute of Technology, Pasadena, California 91125, USA. 
*These authors contributed equally to this work. 


232 | NATURE | VOL 521 | 14 MAY 2015 


©2015 Macmillan Publishers Limited. All rights reserved 


a Light medium Heavy medium b 26 95 
7,53 pw 

ek Se 6 

crosslink 3 Z 

RNA2 = 4 

Hybridize probes S 3 

TT 2 2 

tw oA 

~ 0 


SHARP 
RBM15 
MYEF2 

CELF1 


Denaturing 


LETTER 


Figure 1 | RAP-MS identifies direct Xist- 
interacting proteins. a, A schematic overview of 
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These results demonstrate that the RAP-MS method identifies 
the majority of known RNA-interacting proteins, and that the 
proteins identified by RAP-MS are highly specific for the purified 
ncRNA complex. 

To define the proteins that interact with Xist during the initiation of 
XCI, we UV-crosslinked SILAC-labelled mouse embryonic stem (ES) 
cells after Xist induction’’ and purified Xist from nuclear extracts 
(Methods). To control for background proteins or non-specific pro- 
teins that might interact with any nuclear RNA, we separately purified 
the abundant U1 small nuclear RNA, which is not expected to interact 
with the same proteins as Xist. We identified the proteins in each 
sample using liquid chromatography-mass spectrometry and calcu- 
lated a SILAC ratio for each protein based on the intensity of all 
heavy or light peptides originating from the Xist or U1 purification 
(Fig. 1a, Methods). 

We identified 10 proteins that were enriched for Xist relative to U1 
(SILAC ratio >threefold, Fig. 1b). All 10 proteins were reproducibly 
enriched in multiple Xist purifications from independent biological 
samples (Methods). Consistent with the notion that these proteins 
are direct Xist-interacting proteins, 9 proteins contain well-character- 
ized RNA binding domains (Fig. 1c). 

The identified Xist-interacting proteins are SHARP, RBMI1S5, 
MYEF2, CELF1, HNRNPC, LBR, SAF-A, RALY, HNRNPM, and 
PTBP1 (Fig. 1b). SAF-A (scaffold attachment factor-A, also known 
as HNRNPU) was previously shown to interact directly with Xist and 
is required for tethering Xist to the inactive X chromosome in differ- 
entiated cells’®. In addition, 5 of these proteins have been previously 
implicated in transcriptional repression, chromatin regulation, and 
nuclear organization. These include SHARP (SMRT and HDAC 
associated repressor protein, also known as SPEN), a member of the 
SPEN family of transcriptional repressors, which directly interacts 
with the SMRT component (also known as NCOR2) of the nuclear 
co-repressor complex’ that is known to interact with and activate 
HDAC3 deacetylation activity on chromatin’ (Fig. 1c). Interestingly, 
we also identified RBM15, another member of the SPEN family of 
transcriptional repressors, which shares the same domain structure 
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as SHARP, but seems to have a distinct functional role during develop- 
ment’’. MYEF2 has been shown to function as a negative regulator of 
transcription in multiple cell types, although its mechanism of regu- 
lation is still unknown’*. HNRNPM is a paralogue of MYEF2. Finally, 
we identified LBR (lamin B receptor), a protein that is anchored in the 
inner nuclear membrane and interacts with repressive chromatin reg- 
ulatory proteins and lamin B’” (Fig. 1c). 

We confirmed the specificity of the identified Xist-interacting pro- 
teins (Supplementary Note 2, Methods). To ensure that they were not 
identified owing to non-specific RNA or protein capture, we per- 
formed RAP in uninduced cells (no Xist) and identified no enriched 
proteins. To ensure that these proteins are crosslinked with Xist in cells 
and not merely associating in solution, we performed RAP in cells that 
were not crosslinked (no UV) and identified no enriched proteins. To 
ensure that these proteins do not merely interact with any nuclear- 
enriched long ncRNA, we compared the Xist-purified proteins to those 
purified with 45S (pre-ribosomal RNA) and found that all 10 Xist- 
interacting proteins were still enriched. Finally to validate these inter- 
actions independently, we obtained high-quality affinity reagents for 8 
of the 10 proteins (PTBP1, HNRNPC, CELF1, MYEF2, RBM15, LBR, 
RALY and SHARP), and immunoprecipitated the identified proteins 
in UV-crosslinked lysates. In all cases, we observed a strong enrich- 
ment for the Xist RNA (>fourfold), but not control mRNAs or 
IncRNAs (Extended Data Fig. 3, Supplementary Table 1). 

Together, these results identify a set of highly specific and repro- 
ducible proteins that interact directly with Xist during the initiation of 
XCI. Given the generality of the RAP-MS approach, we expect that it 
will be broadly applicable for defining the proteins that interact 
directly with other IncRNAs. 

To determine which proteins are required for Xist-mediated tran- 
scriptional silencing, we knocked down each of the proteins identified 
and assayed for the failure to silence gene expression on the X chro- 
mosome upon induction of Xist expression (Fig. 2a). 

Specifically, we selected two X-linked genes, Gpc4 and Atrx, that are 
well expressed in the absence of Xist expression, but are normally 
silenced by 16h of Xist induction in our doxycycline-inducible system 
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in male cells (Fig. 2b, Methods). We used short interfering RNAs 
(siRNAs) to knockdown the messenger RNA levels of each of the pro- 
teins identified by RAP-MS along with several negative controls 
(Methods, Supplementary Table 2). We ensured that each cell examined 
showed both successful depletion of the siRNA-targeted mRNA (>70% 
reduction) as well as induction of Xist expression using single-molecule 
RNA fluorescence in situ hybridization (FISH; Methods). Within each 
of these cells, we quantified the mRNA level of each of the two X-linked 
genes before Xist induction (—Dox) and after Xist induction (+ Dox). 
As a control, we transfected several non-targeting siRNAs 
(Methods). In these negative controls, we observed the expected silen- 
cing of the X-linked genes studied (Gpc4 transcript levels decreased 
from an average of 20 copies (—Dox) to 2 copies (+ Dox) per cell and 
Atrx transcript levels decreased from 22 to 3 copies per cell; Fig. 2b, c). 
Consistent with previous observations, we found no effect on X chro- 
mosome gene silencing upon knockdown of EED”*”’, a required com- 
ponent of PRC2” (Fig. 2b), or other proteins previously associated 
with Xist that do not seem to be required for transcriptional silencing 
(Extended Data Fig. 4, Methods). Similarly, knockdown of RBM15, 
MYEF2, PTBP1, CELF1, HNRNPC, RALY or HNRNPM did not alter 
gene silencing on the X chromosome (Fig. 2b, Extended Data Fig. 5). 
In contrast, knockdown of SHARP, LBR or SAF-A largely abolished 
the silencing of X chromosome genes following Xist induction (Fig. 2b, c, 
Supplementary Note 3, Extended Data Figs 5, 6). Indeed, the expression 
levels of the X chromosome genes studied did not significantly change 
following Xist expression (Fig. 2c, Extended Data Fig. 5). These same 
silencing defects were observed with several independent siRNAs 
(Extended Data Fig. 7). Importantly, we observed the same X chro- 
mosome silencing defects upon knockdown of SHARP, LBR or SAF-A 
in differentiating female ES cells (Extended Data Fig. 8, Methods). 
These results demonstrate that SHARP, LBR and SAF-A are 
required for Xist-mediated transcriptional silencing of the X chro- 
mosome. Although the remaining seven Xist-interacting proteins 
showed no effect on X-chromosome gene silencing, they may still be 
important for Xist function. Some may have redundant functions (for 
example, MYEF2 and HNRNPM, which are known paralogues), in 
some of these cases, the small amount of protein remaining after knock- 
down may still be sufficient for Xist function, or some of these proteins 
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Figure 2 | SHARP, LBR and SAF-A are required 
for Xist-mediated gene silencing. a, Screen for 
Xist-mediated gene silencing for knockdown of 
control (top), non-silencing proteins (middle), or 
silencing proteins (bottom). b, Gpc4 mRNA levels 
after induction of Xist (+ Dox) normalized to Gpc4 
levels before Xist induction (— Dox). Error bars, 
standard error of the mean across 50 cells from one 
experiment. siCtrl, scrambled siRNA control. 

c, Images of individual cells for two X-linked 
mRNAs, Gpc4 (green) and Atrx (red), and DAPI 
(blue) after treatment with different siRNAs 
(rows). The number of identified mRNAs is shown. 
Scale bars, 5 um. 


may be important for alternative Xist-mediated roles, such as the main- 
tenance of XCI, which would not be captured by this silencing assay. 
Xist initiates XCI by spreading across the future inactive X chro- 
mosome, excluding RNA polymerase II (PolII), and repositioning 
active genes into a transcriptionally silenced nuclear compart- 
ment*!°*”*, All of these roles—localization, RNA PollII exclusion 
and repositioning—are required for proper silencing of transcription 
during the initiation of XCT’. 
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Figure 3 | SHARP is required for exclusion of Pol II from the Xist-coated 
territory. a, Xist (red), PolII (green) and DAPI (blue) across different siRNA 
conditions (rows). b, c, Quantification of fluorescence intensity of Pol II within 
Xist territory normalized to control siRNA levels for male ES cells after 16 h of 
doxycycline treatment (b) and female ES cells after 1 day of retinoic-acid- 
induced differentiation (c). Error bars, standard error of the mean across 50 
cells from one experiment. NS, not significant; ***P value < 0.005, ****P 
value < 0.001 relative to siControl by unpaired two-sample t-test. AA, genetic 
deletion of A-repeat of Xist. Scale bars, 5 um. 


©2015 Macmillan Publishers Limited. All rights reserved 


Consistent with previous observations that SAF-A is required for 
Xist localization to chromatin in differentiated cells'®, we observed a 
diffuse Xist localization pattern in the nucleus upon knockdown of 
SAF-A (Extended Data Fig. 5). This suggests that SAF-A is required for 
transcriptional silencing by localizing Xist, and its silencing proteins, 
to the X chromosome during the initiation of XCI. 

To determine the proteins responsible for establishing the initial 
silenced compartment on the X chromosome, we explored whether 
SHARP or LBR are required for the exclusion of Pol II from the Xist- 
coated region. Specifically, we measured the co-localization of Xist and 
PolII in single cells (Methods). In wild-type cells after 16h of Xist 
induction, we observed a depletion of Pol II over the Xist-coated terr- 
itory (Fig. 3a, Methods). We observed a similar exclusion of Pol II 
from the Xist-coated region in the negative controls and upon knock- 
down of EED or LBR (Fig. 3a, b). In contrast, upon knockdown of 
SHARP, we observed higher levels of PolII over the Xist-coated 
territory relative to the control samples (Fig. 3b). We confirmed that 
SHARP, but not LBR or EED, is similarly required for Pol II exclusion 
in differentiating female ES cells (Fig. 3c, Extended Data Fig. 9, 
Supplementary Note 4). 

These results demonstrate that SHARP is required to exclude Pol II 
on the inactive X chromosome and may be required for creating the 
initial silenced compartment upon Xist localization*. Although LBR is 
not required for Pol II exclusion, it is likely to have an alternative role 
during the initiation of Xist-mediated transcriptional silencing, such as 
repositioning genes into this Pol II-excluded compartment’*”’. 

Having identified SHARP as the direct Xist-interacting protein that 
is required for excluding PolII on the X chromosome, we sought to 
determine how it might carry out this role. SHARP is a direct RNA 
binding protein®”* that was first identified in mammals on the basis of 
its interaction with the SMRT co-repressor complex’®, which is known 
to interact with HDAC3 and is required for activating its deacetylation 
and transcriptional silencing activity in vivo’. Based on these previous 
observations, we hypothesized that Xist-mediated transcriptional 
silencing through SHARP would occur through SMRT and the silen- 
cing function of HDAC3. (We would not expect to identify these 
proteins by RAP-MS, which was designed to identify only direct 
RNA-protein interactions.) 

To test this hypothesis, we knocked down either SMRT or HDAC3 
and measured the expression of X chromosome genes upon Xist 
induction. Knockdown of SMRT or HDAC3 in both male and female 
ES cells abrogated silencing of X chromosome genes upon induction of 
Xist expression (Fig. 2b, Extended Data Figs 5, 7 and 8). To ensure that 
the observed silencing defect is specific for HDAC3 and not for other 
class I HDAC proteins, we knocked down HDAC1 or HDAC2 and 
observed no effect on gene silencing (Extended Data Fig. 5). To further 
confirm the specificity of our results, we used independent siRNAs to 
knockdown SMRT or HDAC3 and in all cases identified a similar 
silencing defect (Extended Data Fig. 7). 

To determine whether this effect is similar to the effect produced by 
knockdown of SHARP or a distinct defect in transcriptional silencing, 
we tested whether HDAC3, the silencing protein in this complex”, is 
required for the exclusion of RNA Pol II from the Xist-coated territory. 
We found that knockdown of HDAC3 in both male and female ES cells 
eliminated the exclusion of RNA Pol II from the Xist-coated compart- 
ment to a similar degree to that seen for knockdown of SHARP (Fig. 3, 
Extended Data Fig. 9). 

These results suggest that SHARP silences transcription through 
SMRT and the HDAC3 silencing protein. This role for HDAC3 in 
Xist-mediated silencing would explain the long-standing observation 
of global hypoacetylation on the entire X chromosome as one of the 
very first events that occur upon initiation of XCI°”*. 

One of the features of XCI is the recruitment of PRC2 and its 
associated H3K27me3 repressive chromatin modifications across the 
X chromosome in an Xist-dependent manner**’. Although PRC2 is 
not required for the initiation of XCI'*”” (Fig. 3b), it or its associated 
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Figure 4 | SHARP is required for PRC2 recruitment across the Xist-coated 
territory. a, Xist (red), EZH2 (green) and DAPI (blue) across siRNA 
conditions (rows). b, c, Quantification of EZH2 levels within the defined Xist 
territory normalized to the levels in the control siRNA sample for male ES cells 
(b) and differentiating female ES cells (c). Error bars, standard error of the 
mean across 50 cells from one experiment. NS, not significant; ***P 

value < 0.005, ****P value < 0.001 relative to siControl by an unpaired two- 
sample t-test. Scale bars, 5 um. d, A model for Xist-mediated transcriptional 
silencing and recruitment of PRC2 across the X chromosome. 


H3K27me3 repressive chromatin modifications may be involved in 
establishing an epigenetically silenced state”. Yet, how Xist recruits the 
PRC2 complex across the X chromosome is unknown. Since we failed 
to identify any PRC2 components by RAP-MS, and various HDAC 
complexes are known to recruit PRC2”%, we hypothesized that PRC2- 
recruitment is mediated by SHARP and HDAC3. 

To test this hypothesis, we looked at PRC2 recruitment to the Xist- 
coated territory. In wild-type cells, we observe a strong enrichment of 
EZH2, a component of PRC2, over the Xist-coated territory after 16h 
of induction (Fig. 4a). Upon knockdown of EED, a distinct component 
of the PRC2 complex that is required for its proper localization to 
chromatin”’, we observe no enrichment of EZH2 over the Xist-coated 
territory at this same time point (Fig. 4a). Similarly, upon knockdown 
of SHARP, we identified a loss of EZH2 over the Xist-coated territory, of 
comparable magnitude to that observed in the absence of EED (Fig. 4a). 
Conversely, upon knockdown of LBR, we observed a strong enrichment 
of EZH2 over the Xist-coated territory, of comparable magnitude to the 
levels of recruitment in wild-type conditions (Fig. 4b). To determine 
whether HDACS3 is required for PRC2 recruitment, we knocked down 
HDAC3 and observed a loss of PRC2 recruitment (Fig. 4a), of compar- 
able magnitude to that observed upon loss of SHARP (Fig. 4b). 
Knockdown of SHARP or HDAC3 led to the same PRC2-recruitment 
defect in female ES cells (Fig. 4c, Extended Data Fig. 10). 

These results indicate that Xist-mediated recruitment of PRC2 
across the X chromosome is dependent on SHARP and HDAC3. 
Whether this occurs through an interaction with SHARP or HDAC3 
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(direct recruitment) or due to the HDAC3-induced silenced transcrip- 
tion state, chromatin modifications, or compact chromatin structure 
(indirect recruitment) remains unclear (Supplementary Note 5). Yet, 
our results are in contrast to a previous model that PRC2 is recruited 
through a direct interaction between EZH2 and the A-repeat of Xist*. 
The evidence for this PRC2-Xist interaction is based on in vitro binding 
and purifications in non-denaturing conditions*. Recently, the specifi- 
city of this interaction has been questioned because PRC2 appears 
to bind promiscuously to many RNAs, including bacterial RNAs, in 
these conditions’’. Instead, our results are consistent with reports 
that deletion of the A-repeat, unlike knockdown of SHARP or 
HDAC3, has no significant effect on PRC2 recruitment to the Xist- 
coated territory’ (Fig. 4b). 

Taken together, our data suggest a model for how Xist can orches- 
trate transcriptional silencing on the X chromosome (Fig. 4d). Upon 
initiation of Xist expression, Xist can localize to sites on the X chro- 
mosome by binding to the SAF-A protein”®, which is known to interact 
directly with chromatin”®. Xist interacts directly with SHARP to recruit 
SMRT* to these DNA sites across the inactive X chromosome. This 
Xist-SHARP-SMRT complex either recruits HDAC3 directly to the X 
chromosome or may act to induce the enzymatic activity of HDAC3’ 
that may already be present at active genes across the X chromosome”’. 
Through HDAC3, Xist can direct the removal of activating histone 
acetylation marks on chromatin, thereby compacting chromatin and 
silencing transcription’’. Upon initiating the silenced state, Xist 
recruits PRC2 across the X chromosome in an HDAC3-dependent 
manner, either through a direct interaction between PRC2 and 
HDAC3 or indirectly through HDAC3-induced transcriptional silen- 
cing or chromatin compaction (Supplementary Note 5). In this way, 
the same Xist-interacting protein might achieve two essential roles in 
XCI: initiating the inactive state by recruiting transcriptional silencers 
(HDAC3) and maintaining the inactive state by recruiting stable epi- 
genetic silencers (PRC2)”’. Beyond Xist, RAP-MS provides a critical 
tool that will accelerate the discovery of novel IncRNA mechanisms 
that have thus far proved elusive. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. Detailed RAP-MS 
protocols are available at the authors’ web site: (http://www.Incrna.caltech.edu/ 
RAP). 

Mouse ES cell culture. All mouse ES cell lines were cultured in serum-free 2i/LIF 
medium as previously described’’. We used the following cell lines: wild-type male 
ES cells (V6.5 line); male ES cells expressing Xist from the endogenous locus under 
control of a tet-inducible promoter (pSM33 ES cell line) as previously described’’; 
male ES cells carrying a cDNA Xist transgene without the A-repeat integrated into 
the Hprt locus under control of the tet-inducible promoter (A-repeat deletion: 
provided by A. Wutz)*’; female ES cells (F1 2-1 line). This wild-type female mouse 
ES cell line is derived from a 129 X castaneous Fl mouse cross as previously 
described’. 

Xist induction. For Dox inducible cells (pSM33 and A-repeat deletion), we 
induced Xist expression by treating cells with 2 1g ml~' doxycycline (Sigma) for 
6h, 16h, or 24h based on the application. For female ES cells (F1 2-1 line), we 
induced Xist expression by inducing differentiation; 2i was replaced with MEF 
media (DMEM, 10% Gemini Benchmark FBS, 1 X L-glutamine, 1 x NEAA, 1 X 
penicillin/streptomycin; Life Technologies unless otherwise indicated) for 24h 
followed by treatment with 1 |1M retinoic acid (RA) (Sigma) for an additional 24h. 

We measured the amount of Xist RNA in both the doxycycline-inducible cells 
(6h induction) and differentiating female ES cells (24h induction) by qRT-PCR. 
We normalized this level to various RNA housekeeping controls, 18S, 28S, and U6, 
in both cell populations and calculated the fold expression difference between male 
and female cells using the comparative Ct method. We observed a range of 
expression, with the male inducible system expressing from 5 to 20-fold (12-fold 
average) more Xist than the female cells. We note that this estimate likely repre- 
sents an upper limit of the actual differences because the female ES cell system is 
known to be heterogeneous in Xist-induction, such that not every cell will induce 
Xist to the same level after 24h of retinoic acid treatment. Accordingly, we expect 
that the actual differences between the male inducible system and differentiating 
female ES cells are actually significantly lower. While the precise levels are hard to 
compare by single molecule FISH, the size and intensity of each Xist RNA-coated 
territory is similar in both systems at the time points used. 

The male-inducible system is more sensitive for identifying proteins that affect 
silencing compared to a female system because Xist-mediated silencing in males 
will lead to loss of 100% of X-chromosome transcripts rather than only 50% in a 
female system, which still retains one active X. 

UV crosslinking. Cells were washed once with PBS and then crosslinked on ice 
using 0.8 J cm 7 (UV8k) of UV at 254nmina Spectrolinker UV Crosslinker. Cells 
were then scraped from culture dishes, washed once with PBS, pelleted by cent- 
rifugation at 1,500g for 4 min, and flash-frozen in liquid nitrogen for storage at 
-80°C. 

SILAC ES cell culture. For SILAC experiments, we adapted our ES cell culture 
procedures to incorporate either light or heavy lysine and arginine amino acids. 
The 2i/LIF SILAC medium was composed as follows: custom DMEM/F-12 with- 
out lysine or arginine (Dundee Cell Products) was supplemented with either 
0.398mM heavy Argl0 (Sigma) or unlabelled arginine (Sigma) and either 
0.798 mM heavy Lys8 (Cambridge Isotope Labs) or unlabelled lysine (Sigma), 
0.5 X B-27 (Gibco), 2 mg ml! bovine insulin (Sigma), 1.37 ug ml” ' progesterone 
(Sigma), 5 mg ml | BSA Fraction V (Gibco), 0.1 mM 2-mercaptoethanol (Sigma), 
5ng ml! murine LIF (GlobalStem), 0.1 1M PD0325901 (SelleckChem) and 
0.3 4M CHIR99021 (SelleckChem). Cells in both heavy and light 2i/LIF SILAC 
medium were also supplemented with 0.2 mg ml! of unlabelled proline (Sigma) 
to prevent conversion of labelled arginine to proline. 2i inhibitors were added fresh 
with each medium change. 

Adapting cells to SILAC conditions. Prior to mass spectrometry, ES cells were 
adapted to SILAC conditions over three passages. The heavy or light culture 
medium was replaced every 24-48h depending on cell density, and cells were 
passaged every 72h using 0.025% trypsin (Gibco), rinsing dissociated cells from 
the plates with DMEM/F12 containing 0.038% BSA Fraction V (Gibco). Cells were 
grown in two different types of medium: 2i/LIF SILAC medium with light (unla- 
belled) lysine and arginine, or 2i/LIF SILAC medium with heavy isotope-labelled 
lysine and arginine. 

Measuring SILAC incorporation. To examine the efficiency of SILAC labelling 
in pSM33 cells, we tested for the incorporation of labelled amino acids after 10 days 
of growth (3 cell passages) in heavy 2i/LIF SILAC medium. Pellets of 2 million cells 
were boiled for 10min in LDS Sample Loading Buffer (Invitrogen) and then 
proteins were separated by SDS-PAGE on a 4-12% Tris-Glycine polyacrylamide 
gel (Invitrogen). Total protein was stained with Colloidal Coomassie (Invitrogen) 
and gel slices were excised with a clean scalpel and transferred to microcentrifuge 
tubes for in-gel tryptic digest. Protein disulphide bonds were reduced with DTT 
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then alkylated with iodoacetamide. Proteins were digested with trypsin overnight 
and then extracted using successive washes with 1% formic acid/2% acetonitrile, 1:1 
acetonitrile/water, and 1% formic acid in acetonitrile. Peptides were collected, 
lyophilized, then resuspended in 1% formic acid for mass spectrometry analysis 
(described later in Mass spectrum measurements). Peptides were identified from 
mass spectra using MaxQuant (described later in Mass spectrometry data analysis). 
The incorporation rate of labelled amino acids was calculated based on the ratio of 
the intensity of heavy and light versions of each peptide identified. In cells used for 
subsequent assays, we confirmed that over 95% of peptides from cellular proteins 
showed >95% incorporation of labelled amino acids (Extended Data Fig. 1b). 
RNA antisense purification-mass spectrometry (RAP-MS). Probe design and 
generation. To create the probes used to capture target RNAs, we designed and 
synthesized 90-mer DNA oligonucleotides (Eurofins Operon) that spanned the 
entire length of the target RNA. The sequence of each DNA oligonucleotide probe 
was antisense to the complementary target RNA sequence. Each DNA oligonucleo- 
tide probe was also modified with a 5’ biotin in order to enable capture of DNA- 
RNA hybrids on streptavidin coated magnetic beads (described below). While we 
had previously used 120-mer probes, we found that 90-mer probes provided com- 
parable stringency and yield in the conditions used. For Xist, we used 142 probes 
that covered the entire mature RNA sequence, with the exception of regions that 
match to other transcripts or genomic regions as previously described'***. 

Total cell lysate preparation. For the 18S and U1 experiments we used total cellular 
lysates prepared in the following manner. We lysed batches of 20 million cells 
by completely resuspending frozen cell pellets in ice cold detergent-based Cell 
Lysis Buffer (10 mM Tris pH 7.5, 500 mM LiCl, 0.5% dodecyl maltoside (DDM, 
Sigma), 0.2% sodium dodecyl sulphate (SDS, Ambion), 0.1% sodium deoxycholate 
(Sigma)). Next, 1 X Protease Inhibitor Cocktail (Set III, EDTA-free, Calbiochem) 
and 920 U of Murine RNase Inhibitor (New England Biolabs) were added and the 
sample was incubated for 10 min on ice to allow lysis to proceed. During this 
incubation period, the cell sample was passed 3-5 times through a 26-gauge needle 
attached to a 1 ml syringe in order to disrupt the pellet and shear genomic DNA. 
Each sample was then sonicated using a Branson Digital Sonifier with a microtip 
set at 5 W power for a total of 30 s in intermittent pulses (0.7 s on, 1.3 s off). During 
sonication the samples were chilled to prevent overheating of the lysate. The 
samples were then treated for 10min at 37°C with 2.5mM MgCh, 0.5mM 
CaCl, and 20U of TURBO DNase (Ambion) to digest DNA. Samples were 
returned to ice and the reaction was immediately terminated by the addition of 
10mM EDTA and 5 mM EGTA. Disulphide bonds were reduced by addition of 
2.5 mM Tris-(2-carboxyethyl) phosphine (TCEP) and samples were then mixed 
with twice the lysate volume of 1.5 X LiCl/Urea Buffer (the finall < buffer con- 
tains 10 mM Tris pH 7.5, 5 mM EDTA, 500 mM LiCl, 0.5% DDM, 0.2% SDS, 0.1% 
deoxycholate, 4 M urea, 2.5 mM TCEP). Lysates were incubated on ice for 10 min 
then cleared by centrifugation in an Eppendorf 5424R centrifuge for 10 min at 
16,000g. Supernatants were pooled and flash frozen in liquid nitrogen for storage 
at —80°C. 

Nuclear lysate preparation. For the Xist versus U1 and 45S versus U1 comparisons, 
we used nuclear lysates prepared in the following manner. We lysed batches of 50 
million cells by resuspending frozen pellets in 1 ml Lysis Buffer 1 (10 mM HEPES 
pH 7.2, 20 mM KCl, 1.5mM MgCh, 0.5 mM EDTA, 1 mM Tris(2-carboxyethyl)- 
phosphine (TCEP), 0.5 mM PMSF). Then the samples were centrifuged at 3,300g 
for 10 min to pellet cells. The cell pellets were resuspended in 1 ml Lysis Buffer 1 
with 0.1% dodecyl maltoside (DDM) and Dounced 20 times using a glass Dounce 
homogenizer with the small clearance pestle (Kontes). Nuclei released from the 
cells after Douncing were pelleted by centrifugation at 3,300g then resuspended in 
550 ul Lysis Buffer 2 (20mM Tris pH7.5, 50mM KCl, 1.5mM MgCl, 2mM 
TCEP, 0.5mM PMSF, 0.4% sodium deoxycholate, 1% DDM, and 0.1% 
N-lauroylsarcosine (NLS)). Samples were incubated on ice for 10 min, then each 
sample was sonicated using a Branson Sonifier at 5 W power for a total of 1 min in 
intermittent pulses (0.7s on, 3.3s off) to lyse nuclei and solubilize chromatin. 
During sonication the samples were chilled to prevent overheating of the nuclear 
lysate. Samples were then treated with 2.5mM MgCh, 0.5mM CaCh, and 
330 U TURBO DNase (Ambion) for 12 min at 37 °C to further solubilize chro- 
matin. After DNase treatment, lysates were mixed with equal volume of 
2 X Hybridization Buffer (the final 1 X Buffer contains 10mM Tris pH7.5, 
5mM EDTA, 500 mM LiCl, 0.5% DDM, 0.2% SDS, 0.1% deoxycholate, 4 M urea, 
2.5mM TCEP). Finally, lysates were cleared by centrifugation for 10min at 
16,000g in an Eppendorf 5424R centrifuge and the resulting supernatants were 
pooled and flash frozen in liquid nitrogen for storage at —80 °C. 

RNA antisense purification of crosslinked complexes. Lysates from 200 million or 
800 million cells were used for each capture. For 200 million cells the following 
protocol was used, and scaled appropriately for larger cell numbers. For each 
capture, a sample of heavy or light SILAC labelled frozen lysate was warmed to 
37 °C. For each sample, 1.2 ml of Streptavidin Dynabeads MyOne C1 magnetic 
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beads (Invitrogen) were washed 6 times with equal volume of hybridization buffer 
(10mM Tris pH 7.5, 5mM EDTA, 500mM LiCl, 0.5% DDM, 0.2% SDS, 0.1% 
deoxycholate, 4M urea, 2.55mM TCEP). Lysate samples were pre-cleared by 
incubation with the washed Streptavidin C1 magnetic beads at 37 °C for 30 min 
with intermittent shaking at 1,100r.p.m. on a Eppendorf Thermomixer C (30s 
mixing, 30s off). Streptavidin beads were then magnetically separated from lysate 
samples using a Dynamag magnet (Life Technologies). The beads used for pre- 
clearing lysate were discarded and the lysate sample was transferred to fresh tubes 
twice to remove all traces of magnetic beads. Biotinylated 90-mer DNA oligonu- 
cleotide probes specific for the RNA target of interest (20 jig per sample, in water) 
were heat-denatured at 85 °C for 3 min and then snap-cooled on ice. Probes and 
pre-cleared lysate were mixed and incubated at 67 °C using an Eppendorf thermo- 
mixer with intermittent shaking (30s shaking, 30 s off) for 2 h to hybridize probes 
to the capture target RNA. Hybrids of biotinylated DNA probes and target RNA 
were then bound to streptavidin beads by incubating each sample with 1.2 ml of 
washed Streptavidin coated magnetic beads at 67 °C for 30 min on an Eppendorf 
Thermomixer C with intermittent shaking as above. Beads with captured hybrids 
were washed 6 times with LiCl/Urea Hybridization Buffer at 67 °C for 5 min to 
remove non-specifically associated proteins. Between 0.5 and 1% of the total beads 
were removed and transferred to a fresh tube after the final wash to examine RNA 
captures by qPCR (see Elution and analysis of RNA samples). The remaining 
beads were resuspended in Benzonase Elution Buffer (20mM Tris pH 8.0, 
2mM MgCh, 0.05% NLS, 0.5 mM TCEP) for subsequent processing of the protein 
samples. 

Elution of protein samples. Elution of captured proteins from streptavidin beads 
was achieved by digesting all nucleic acids (both RNA and DNA, double-stranded 
and single-stranded) using 125 U of Benzonase nonspecific RNA/DNA nuclease 
for 2h at 37°C (Millipore, #71206-3). Beads were then magnetically separated 
from the sample using a DynaMag magnet (Life Technologies) and the supernat- 
ant containing eluted Xist-specific proteins were precipitated overnight at 4°C 
with 10% trichloroacetic acid (TCA). TCA treated protein elution samples were 
pelleted by centrifugation for 30 min at >20,000g, then washed with 1 ml cold 
acetone and recentrifuged. Final protein elution pellets were air dried to remove 
acetone and stored at —20°C until processing for mass spectrometry. 

Elution and analysis of RNA samples. Beads with hybrids were magnetically sepa- 
rated using a 96-well DynaMag (Life Technologies) and the supernatant was 
discarded. Beads were then resuspended by pipetting in 20 pl NLS RNA Elution 
Buffer (20 mM Tris pH 8.0, 10 mM EDTA, 2% NLS, 2.5 mM TCEP). To release the 
target RNA, beads were heated for 2 min at 95 °C in an Eppendorf Thermomixer 
C. Beads were then magnetically separated using a 96-well DynaMag (Life 
Technologies) and the supernatants containing eluted target RNA were digested 
by the addition of 1 mg ml’ Proteinase K for 1h at 55 °C to remove all proteins. 
The remaining nucleic acids were then purified by ethanol precipitation onto 
SILANE beads (Invitrogen) as previously described’***. DNA probes were 
removed by digestion with TURBO DNase (Ambion). To quantify RNA yield 
and enrichment, qPCR was performed as previously described’’. 

Mass spectrometry analysis. Preparation of proteins for mass spectrometry. 
Proteins from RAP-MS captures were resuspended in fresh 8 M urea dissolved 
in 40 ul of 100 mM Tris-HCl pH 8.5. Disulphide bonds were reduced by incuba- 
tion with 3 mM TCEP for 20 min at room temperature, followed by alkylation with 
11 mM iodoacetamide for 15 min at room temperature in the dark. Samples were 
then digested with 0.1 tg endoproteinase Lys-C for 4h at room temperature. After 
Lys-C digestion the samples were diluted to a final concentration of 2 M urea by 
the addition of 100 mM Tris-HCl pH 8.5, and CaCl, was added to a final concen- 
tration of 1 mM. Tryptic peptides were generated by treatment with 0.1 to 0.5 g of 
trypsin overnight at room temperature. Contaminating detergents were removed 
from peptides using HiPPR detergent removal columns (Thermo), and peptides 
were protonated by the addition of 5% formic acid before desalting on a Microm 
Bioresources C8 peptide MicroTrap column. Peptide fractions were collected and 
lyophilized, and dried peptides were resuspended in 0.2% formic acid with 5% 
acetonitrile. 

Mass spectrum measurements. Liquid chromatography-mass spectrometry and 
data analyses of the digested samples were carried out as previously described** 
with the following modifications. All experiments were performed on a nanoflow 
LC system, EASY-nLC 1000 coupled to a hybrid linear ion trap Orbitrap Elite mass 
spectrometer (Thermo Fisher Scientific, Bremen, Germany) equipped with a 
nanoelectrospray ion source (Thermo Fisher Scientific). For the EASY-nLC II 
system, solvent A consisted of 97.8% H,O, 2% acetonitrile, and 0.2% formic acid 
and solvent B consisted of 19.8% HO, 80% acetonitrile, and 0.2% formic acid. For 
the LC-MS/MS experiments, 200 ng of digested peptides were directly loaded at a 
flow rate of 500nlmin | onto a 16-cm analytical HPLC column (75 um ID) 
packed in-house with ReproSil-Pur C;sAQ 3pm resin (120A pore size, Dr. 
Maisch, Ammerbuch, Germany). The column was enclosed in a column heater 


operating at 30 °C. After 30 min of loading time, the peptides were separated with a 
75 min gradient at a flow rate of 350 nl min’. The gradient was as follows: 0-2% 
Solvent B (5 min), 2-30% B (60 min), and 100% B (10 min). The Elite was operated 
in data-dependent acquisition mode to automatically alternate between a full scan 
(m/z = 400-1,600) in the Orbitrap and subsequent rapid 20 collision-induced 
dissociation (CID) MS/MS scans in the linear ion trap. CID was performed with 
helium as collision gas at a normalized collision energy of 35% and 10 ms of 
activation time. 

MS data analysis. Thermo RAW files were searched with MaxQuant (v 
1.5.0.30)°**”. Spectra were searched against all UniProt mouse entries (43,565 
entries, downloaded 02 Oct 14) and MaxQuant contaminant database (245 ent- 
ries). Decoy sequences (reversed peptide sequences) were generated in MaxQuant 
to estimate the false discovery rate**. Search parameters included multiplicity of 2 
with heavy Arg (+ 10.0083) and heavy Lys (+8.0142) as heavy peptide modifica- 
tions. Variable modifications included oxidation of Met (+15.9949) and protein 
N-terminal acetylation (+42.0106). Carboxyamidomethylation of Cys 
(+57.0215) was specified as a fixed modification. Protein and peptide false dis- 
covery rates were thresholded at 1%. Precursor mass tolerance was 7 p.p.m. (or less 
for individual peptides). Fragment mass tolerance was 0.5 Da. Requantify and 
match between runs were both enabled. Trypsin was specified as the digestion 
enzyme with up to 2 missed cleavages. 

Identification of RNA-interacting proteins. Proteins of interest from RAP-MS 
captures were identified based on several criteria. First, proteins were considered 
identified only if 2 or more unique peptides were found in the mass spectrum. 
Then proteins of interest were selected based on the SILAC ratio of capture versus 
control samples. SILAC ratios for each peptide were calculated based on the 
intensity ratios of heavy and light SILAC pairs. The protein ratio is the median 
of all calculated peptide ratios, with a minimum of two SILAC pairs required for a 
SILAC ratio to be assigned to a given protein. A SILAC ratio cutoff of = 3.0 (fold 
enrichment over control sample) was used as a cutoff for further analysis. We 
excluded known contaminants, including human keratin and proteins introduced 
during the sample purification and preparation process (such as streptavidin, 
benzonase, and trypsin), as well as naturally biotinylated proteins that contam- 
inate the preparation by binding to streptavidin beads. 

RAP-MS experiments and controls. 18S rRNA versus U1 snRNA. To validate the 
RAP-MS method and identify proteins specifically interacting with 18S ribosomal 
RNA or U1 snRNA, we performed captures of each target RNA in parallel samples 
from heavy and light labelled lysates from wild-type V6.5 ES cells. The total 
protein quantity in elution samples from each RAP-MS capture was measured 
by comparing the median intensity of peptides identified in a single quantitation 
MS run for each sample. The heavy and light label swapped samples were then 
mixed equally based on total protein quantity and analysed by mass spectrometry 
to identify the SILAC enrichment ratio of proteins originating from 18S ribosomal 
RNA or U1 snRNA captures. The experiment was performed twice and each 
experimental set contained two biological replicates of 18S and U1 captures (heavy 
and light labelling states). 

Xist IncRNA versus U1 snRNA captures. To identify proteins specifically interact- 
ing with Xist IncRNA, we performed captures as described above with either 200 
million or 800 million pSM33 cells treated with doxycycline for 6h. The total 
protein quantity in elution samples from each RAP-MS capture was measured 
by a single quantitation MS run for each sample. Heavy and light label swapped 
samples were mixed equally based on total protein quantity, and analysed by mass 
spectrometry. SILAC ratios of Xist-enriched proteins versus U1-enriched proteins 
were calculated and used to identify Xist-specific interacting proteins for further 
analysis. The experiment was performed twice and each experimental set con- 
tained two biological replicates of Xist and U1 captures, from heavy and light 
labelled samples. Proteins replicated well between samples, with a sole exception 
(LBR) that was missed only because its enrichment level (twofold) fell below our 
enrichment cutoff (threefold) in some replicate samples. 

Xist IncRNA capture from non-crosslinked cells. As a control to ensure that purified 
proteins are not non-specifically associated or binding in vitro with target RNAs 
during capture, we performed RAP-MS captures of Xist from non-crosslinked 
cells otherwise treated in the same manner (that is, doxycycline treated for 6h). 
Xist IncRNA capture from cells where Xist is not expressed. To confirm that the 
identified proteins are not resulting from background proteins or probe asso- 
ciation with other RNAs or proteins in the pSM33 cells, we performed RAP 
captures of Xist from pSM33 cells that were not treated with doxycycline, but 
which were otherwise treated identically. 

45S pre-rRNA capture versus U1 capture. To ensure that the proteins enriched in 
Xist captures using RAP-MS are not simply due to increased protein capture as a 
consequence of long target RNA transcripts, we additionally performed captures 
of the 13,000 nucleotide long 45S pre-ribosomal RNA as a control. To ensure 
specific capture only of the 45S, and not the mature 18S and 28S, we designed 
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probes that specifically targeted the internal transcribed spacer regions (ITS1 and 
ITS2) that are only present in the 45S pre-ribosomal RNA. The experiment was 
performed in the same manner and with the same conditions as the Xist ncRNA 
captures described above. To compare Xist protein enrichment to 45S protein 
enrichment, we used a SILAC approach based on direct comparison of two sam- 
ples that share a common denominator (called spike-in SILAC”’). Specifically, we 
calculated an overall Xist/45S SILAC ratio by multiplying the Xist/U1 ratio by the 
U1/45S ratio for each identified protein. 

Protein domain classification. We defined the conserved domain structures of 
proteins using the Protein Families database (Pfam*). 

RNA immunoprecipitation in UV-crosslinked cells. We crosslinked pSM33 
cells after 6h of doxycycline-treatment with 0.4Jcm * of UVasanm. Cells were 
lysed and RNA was digested with RNase I to achieve a size range of 100-500 
nucleotides in length. Lysate preparations were precleared by mixing with Protein 
G beads for 1h at 4 °C. For each sample, target proteins were immunoprecipitated 
from 20 million cells with 10 pg of antibody (Supplementary Table 1) and 60 il of 
Protein G magnetic beads (Invitrogen). The antibodies were pre-coupled to the 
beads for 1h at room temperature with mixing before incubating the precleared 
lysate to the antibody-bead complexes for 2 h at 4°C. After the immunoprecipita- 
tion, the beads were rinsed with a wash buffer of 1 PBS with detergents. After a 
dephosphorylation treatment, the RNA in each sample was ligated to a mixture of 
barcoded adapters in which each adaptor had a unique barcode identifier. After 
ligation, beads were rinsed with 1 PBS and detergents and then 5 X PBS 
(750mM NaCl) and detergents before pooling 3-4 antibodies in a new tube. 
The proteins and RNA were then eluted from the Protein G beads with 6 M urea 
and 40 mM DTT at 60 °C. Protein-RNA complexes were separated away from free 
RNA by covalently coupling proteins to NHS-magnetic beads (Pierce) and wash- 
ing 3 times in 6 M GuSCN (Qiagen RLT buffer) and heating in 1% NLS at 98 °C for 
10 min. The proteins were then digested with Proteinase K and RNA was purified 
for subsequent analysis. From the barcoded RNA in each pool, we generated 
Illumina sequencing libraries as previously described**. We saved a small percent- 
age (~1%) of starting material before immunoprecipitation and processed and 
sequenced this sample in parallel. 

Analysis of crosslinked RNA immunoprecipitation data. We computed the 
enrichment for any RNA upon immunoprecipitation with a specific protein rela- 
tive to its total levels in the cell. To do this, we counted the total number of reads 
overlapping the RNA in either the immunoprecipitation (IP) sample or the input 
control. To account for differences in read coverage between samples, each of these 
numbers was normalized to the total number of reads within the same experiment. 
This generates a normalized score, per RNA, within each sample. We then com- 
puted an enrichment metric by taking the ratio of these normalized values (IP/ 
input). We then compared these enrichment levels across different proteins and 
controls (that is, IgG). To enable direct comparison across proteins for a given 
gene, we need to account for differences in the protein specific background level, 
which may occur to differences in IP efficiency or non-specific binding of each 
antibody. To do this, we computed a normalized enrichment ratio by dividing the 
ratio for each gene by the average ratio across all genes for a given protein, as 
previously described’. 

To exclude the possibility of promiscuous binding to all RNAs, we considered 
various mRNA controls, which are not expected to bind to these proteins, includ- 
ing Oct4, Nanog, Stat3, and Suz12. These mRNAs were selected as examples 
because they are expressed in ES cells, although many mRNAs show similar 
results. To account for the possibility that the Xist RNA non-specifically binds 
to any RBP, we evaluated Xist with other RBPs that we did not identify as inter- 
acting with Xist by RAP-MS (PUM1 and HNRNPH). To ensure that a negative 
result (that is, no enrichment for Xist) is meaningful and does not reflect a failed 
immunoprecipitation experiment, we evaluated Neatl-1, which we previously 
found immunoprecipitates with hnRNPH’. To further evaluate the level of enrich- 
ment on other IncRNAs, we considered several IncRNAs including Malat1, Firre, 
and Tug1. These IncRNAs were selected as examples because they are well-known 
and expressed in ES cells, although many ES IncRNAs show similar results. 
Immunoprecipitation and RT-qPCR. Female ES cells were differentiated then 
crosslinked with UV4k as described above. Pellets of 20 million cells were lysed 
and treated with TURBO DNase (Ambion) to destroy DNA by incubation for 
10 min at 37 °C in an Eppendorf Thermomixer C. The lysate was pre-cleared by 
incubation with 1801 of Dynabeads Protein G magnetic beads (Life 
Technologies). Meanwhile, 10 1g of antibody for immunoprecipitation (SHARP 
antibody, Novus NBP1-82952 or IgG antibody, Cell Signaling 2729S) was coupled 
to 60 pl Protein G magnetic beads. After pre-clearing was completed, the lysate 
was then mixed with the appropriate antibody-coupled Protein G magnetic beads 
and incubated for 2 h at 4°C on a Hulamixer sample mixer (Life Technologies) for 
protein capture. After immunoprecipitation, beads were washed with a wash 
buffer of 1 < PBS with detergents and then captured nucleic acids were eluted 
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by digesting all proteins with 5.6 U proteinase K (New England Biolabs). Eluted 
RNA was purified using the RNA Clean and Concentrator-5 Kit (Zima Research) 
and RT-qPCR was performed as described previously’* to evaluate RNA enrich- 
ment. 

V5-epitope tagged protein expression. For V5-tagged protein expression and 
immunoprecipitation, mouse ES cells were electroporated using the Neon trans- 
fection system (Invitrogen) with an episomally-replicating vector (pCAG-GW- 
V5-Hygro) encoding expression of a C-terminal V5 tagged ORF driven by a CAG 
promoter. ORFs were obtained from the DNASU plasmid repository as Gateway 
entry clones and inserted into pCAG-GW-V5-Hygro using an LR recombination 
reaction (Invitrogen). Transfected cells were selected on 125 ug ml ' Hygromycin 
B (Invitrogen) to generate stably expressing lines. 

siRNA transfections. For siRNA knockdown experiments, 20 nM siRNAs were 
transfected using the Neon transfection system (settings: 1,200 V, 40 ms width, 1 
pulse). For each transfection, two 10 pil transfections with the same siRNA were 
carried out in succession using 100,000 cells each, mixed, and plated equally 
between two poly-.-lysine or poly-p-lysine (Sigma) and 0.2% gelatin (Sigma)- 
coated no. 1.5 coverslips placed into wells of a 24-well plate containing 2i media. 
After 48h, 2i media was replaced and cells on one coverslip of each pair were 
treated with 2 pgml~' doxycycline (Sigma) for 16hr to induce Xist expression. 
Coverslips were then fixed in Histochoice (Sigma) for 5 min, washed thoroughly in 
PBS, and dehydrated in ethanol for storage until FISH staining. 

For all proteins we used siRNA pools from Dharmacon (ON-TARGETplus 
SMARTpool siRNAs). For each of these, we tested whether the siRNA successfully 
reduced the targeted mRNA expression by >70%. For SAF-A and SMRT, the 
siRNAs failed to achieve this level of mRNA reduction, so we purchased additional 
siRNAs (and their associated controls) for SAF-A and SMRT from Qiagen and 
Ambion respectively, and selected siRNAs that successfully reduced on-target 
mRNA levels. siRNA against GFP was purchased from Qiagen. For additional 
independent siRNAs, the siRNAs were purchased as a pool from Dharmacon, 
Qiagen, and Ambion, or as each individual siRNA deconvoluted from the pool 
from Dharmacon and Qiagen (Supplementary Table 2). 

In addition to the proteins identified by RAP-MS, we knocked down several 
proteins previously reported to associate with Xist, including EED (a component 
of PRC2)*, YY1*!, SATB1”, SRSF1“, HNRNPC”, and ATRX”’. 
siRNA experiments in female ES cells. Female ES F1 2-1 cells were similarly 
transfected. To initiate differentiation and Xist expression for these cells, 2i was 
replaced with MEF media (DMEM, 10% Gemini Benchmark FBS, 1x 
L-glutamine, 1X NEAA, 1X penicillin/streptomycin; Life Technologies unless 
otherwise indicated) at 12 h post-transfection. Forty-eight hours after transfection, 
1M retinoic acid (Sigma) was administered for 24h and cells were fixed as 
described above. For cells not undergoing differentiation, 2i was replaced 12h 
and 48h after transfection. 

Single-molecule RNA FISH. Single-molecule RNA fluorescence in situ hybrid- 
ization (FISH) experiments were done using QuantiGene ViewRNA ISH Cell 
Assay (Affymetrix) and QuantiGene ViewRNA ISH Cell 740 Module 
(Affymetrix) according to manufacturer’s protocol. Cells fixed on coverslips were 
first permeabilized with Detergent Solution QC at room temperature for 5 min, 
and then incubated with desired mixture of probe set (Affymetrix) in Probe Set 
Diluent QF at 40 °C for 3 h, followed by incubated with PreAmplifier Mix at 40 °C 
for 30 min, Amplifier Mix at 40 °C for 30 min, and Label Probe Mix at 40 °C for 
30 min sequentially. For DAPI staining, coverslips were incubated in 30 nM DAPI 
in PBS at room temperature for 15-20 min. Probe set and conjugated fluorophore 
for FISH were TYPE 1-XIST (550 nm), TYPE 4-GPC4, RBMX, SMC1A, MECP2 
(488 nm), TYPE 10-ATRX (740 nm), and TYPE 6-EED1, SHARP, LBR, SAFA, 
RBM15, MYEF2, PTBP1, HNRNPC, HNRNPM, CELF1, RALY, HDAC3, 
NCOR2, MID1, PIR (650 nm). 

Immunofluorescence and RNA FISH. For immunofluorescence (IF), cells were 
fixed on coverslips and permeabilized with 0.1% Triton-X in PBS at room tem- 
perature for 10 min, and blocked with 5% normal goat serum in PBS at room 
temperature for 10 min. Cells were then incubated with primary antibodies at 
room temperature for 1h, followed by incubating with secondary antibodies at 
room temperature for 1 h. The samples were then processed using the RNA FISH 
protocol, as described above. Primary antibodies and the dilution used for IF were 
anti-RNA polymerase II CTD repeat YSPTSPS (phospho S2) (Abcam; ab5095) 
(1:100), anti- NANOG (Abcam; ab80892) (1:100), and anti-EZH2 (Active Motif; 
39933) (1:100). Secondary antibodies and the dilution used for IF were Alexa Fluor 
405 goat anti-rabbit IgG (H+L) (Life Technology; 1575534) (1:100) and Alexa 
Fluor 488 F(ab’), fragment of goat anti-rabbit IgG (H+L) (Life Technology; 
1618692) (1:100). 

Microscopic imaging. FISH and IF/FISH samples were imaged using a Leica DMI 
6000 Deconvolution Microscope with the Leica HC PL APO X63/1.30 GLYC 
CORR C82 objective. Samples stained with TYPE 10-ATRX (740nm) were 
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imaged using Nikon Ti Eclipse with the Nikon CFI Plan Apochromat 1 DM X60/ 
1.40 oil objective. Images were projected with maximum projection (3 1m; step 
size, 0.5 [um). 

X-chromosome silencing assay. Cells were stained for Xist RNA, Gpc4 mRNA, 
Atrx mRNA and siRNA-targeted mRNA by FISH and imaged. In addition, in 
some siScramble and sisHARP samples, we used probes against Rbmx, Mecp2, 
Smcla, Mid1 or Pir mRNA. Images were then analysed using Matlab R2013b 
(described below). Cells were selected if the copy number of the targeted mRNA 
was less than 30% of the level of the no siRNA treated cells and if they induced Xist 
expression. Within these cells, the copy number of Gpc4 mRNA and Atrx mRNA 
were quantified using a peak finding method (described below) and compared 
across conditions. We quantified mRNA levels for 50 individual cells. We also 
evaluated Xist expression in siRNA-treated cells, and observed no difference in the 
percentage of cells that induced Xist expression in any of the siRNA conditions 
relative to untreated cells. 

Quantifying mRNAs by single molecule FISH. All image analysis was carried 
out using Matlab (version R2013b) using built-in functions from the Image 
Processing toolbox. Images were first filtered using a two-dimensional median 
filter to remove background. Cell boundaries were outlined manually, guided by 
DAPI staining, to create a binary mask and applied to the various channels from 
the same field of view. Top-hat morphological filtering, a background subtraction 
method that enhances the individual focal spots, was applied to the images**. The 
spots were then identified using a 2D peak finding algorithm that identifies local 
maximal signals within the cell. Once regional maxima were identified, the num- 
ber of spots was counted for each cell. 

Ezh2 recruitment and Pol II exclusion. Cells were stained for Xist RNA and the 
siRNA-targeted mRNA (FISH) along with EZH2 or Pol II (IF) as described above. 
For image acquisition, the exposure time for each individual channel was kept the 
same across all samples. Images were then analysed and selected for XIST-induced 
and cells showing knockdown of the target mRNA, as described above. 
Specifically, the nuclei of individual cells were identified manually using the 
DAPI staining. We identified the Xist-coated territory by using an intensity-based 
threshold to partition the image within the nucleus and find contiguous 2-dimen- 
sional regions of high intensity. The threshold was determined based on the Otsu 
method as previously described*’, which splits the image into 2 bins—high and 


low—and identifies a threshold that minimizes the variance within the partition. 
This creates a binary mask on the image. We visually confirmed that this binary 
mask accurately reflected the Xist-coated territory. We then applied this binary 
mask to all other images in that field of view (Pol II or EZH2) for all images. We 
then quantified the intensity of fluorescence signal by taking the average intensity 
of all the pixels within the region (that is, PolII or EZH2, respectively). We 
computed this average intensity (1 number per cell) across all conditions and 
compared them using a 2-same unpaired t-test relative to the scramble sample 
across 50 single cells. 


33. Wutz, A., Rasmussen, T. P. & Jaenisch, R. Chromosomal silencing and localization 
are mediated by different domains of Xist RNA. Nature Genet. 30, 167-174 (2002). 

34. Engreitz, J. M. et al. RNA-RNA interactions enable specific targeting of noncoding 
RNAs to nascent pre-mRNAs and chromatin sites. Ce// 159, 188-199 (2014). 

35. Kalli, A.& Hess, S. Effect of mass spectrometric parameters on peptide and protein 
identification rates for shotgun proteomic experiments on an LTQ-orbitrap mass 
analyzer. Proteomics 12, 21-31 (2012). 

36. Cox, J.& Mann, M. MaxQuant enables high peptide identification rates, 
individualized p.p.b.-range mass accuracies and proteome-wide protein 
quantification. Nature Biotechnol. 26, 1367-1372 (2008). 

37. Cox, J. et al. Andromeda: a peptide search engine integrated into the MaxQuant 
environment. J. Proteome Res. 10, 1794-1805 (2011). 

38. Elias, J.E.& Gygi, S. P. Target-decoy search strategy for mass spectrometry-based 
proteomics. Methods Mol. Biol. 604, 55-71 (2010). 

39. Geiger, T. et al. Use of stable isotope labeling by amino acids in cell culture as a 
spike-in standard in quantitative proteomics. Nature Protoc. 6, 147-157 (2011). 

40. Finn, R. D. et al. Pfam: the protein families database. Nucleic Acids Res. 42, 
D222-D230 (2014). 

41. Jeon, Y. & Lee, J. T. YY1 tethers Xist RNA to the inactive X nucleation center. Cel! 
146, 119-133 (2011). 

42. Agrelo, R. etal. SATB1 defines the developmental context for gene silencing by Xist 
in lymphoma and embryonic cells. Dev. Cell 16, 507-516 (2009). 

43. Royce-Tolland, M. E. et al. The A-repeat links ASF/SF2-dependent Xist RNA 
processing with random choice during X inactivation. Nature Struct. Mol. Biol. 17, 
948-954 (2010). 

44. Theodosiou, Z. et al. Automated analysis of FISH and immunohistochemistry 
images: a review. Cytometry A 71, 439-450 (2007). 

45. Fumagalli, M. et al. Telomeric DNA damage is irreparable and causes persistent 
DNA-damage-response activation. Nature Cell Biol. 14, 355-365 (2012). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


100 
80 7,000 + 
_ 
> = 4 
3 70 5 6,000 
a 80 = 5,000- 
a) 50 (S) 
> = 4 
= 4,000 
< 40 A 
Zz ® 
% 5 _q 3,000 + 
z 
20 % 2,000 - 
10 1,000 + f 
0 — O-+ T T —_ T +r T = T T T a 7 
_ 7,1 Jp) + Jp) 7,1 Jp) 
wn az wn eos wn xz 
Xist 18S se 00 a S © B = © a 
Replicate 1 Replicate 2 Replicate 3 
Cc 
100% 
90% 
wn 80% 
) 
ZS 70% 
—_ 
[os 
® 60% 
a 
‘S 50% 
5S 40% 
3 
30% 
© ‘0 
Le 20% 
10% 
0% + a. — . — ; —_ 
<50% 50-75% 75-90% 90-95%  >95% 


Incorporation of labeled amino acids 


Extended Data Figure 1 | RAP-MS recovers and enriches the majority of 
Xist RNA from mouse ES cells, and these cells can be efficiently labelled with 
SILAC. a, RT-qPCR measuring the percentage of the total cellular Xist or 18S 
recovered after RAP-MS of Xist. Values are computed as the amount of each 
RNA in the elution divided by the amount of RNA in the starting (‘input’) lysate 
material. Error bars represent the standard error of the mean from 5 biological 
replicates. b, Enrichment of Xist after RAP-MS captures from pSM33 cells as 
measured by qPCR. Bars indicate RNA levels of Xist, 18S, and Oct4 after 


purification of Xist, normalized to RNA in input sample. Each bar represents 
the RNA levels of Xist, 18S, and Oct4 after purification of Xist, normalized to 
RNA in input sample, from 3 biological replicates. c, SILAC labelling efficiency 
of a representative culture of pSM33 mouse ES cells after 10 days of growth 
(3 cell passages) in SILAC medium. Peptides were analysed by mass 
spectrometry, and values indicate the fraction of identified peptides with heavy- 
label incorporation with different levels of peptide labelling (shown in bins). 
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Extended Data Figure 2 | RAP-MS identifies proteins that are known to 
directly interact with specific ncRNAs, and separates specific RNA 
interacting proteins from background proteins. a, SILAC ratios of top 
proteins enriched in the RAP-MS U1 snRNA, 18S rRNA, and 45S pre-rRNA 
experiments. b, SILAC ratio plot of replicate captures of U1 snRNA versus 18S 
rRNA from one of two biologically independent label-swap experiments. 
Proteins associated with U1 are consistently found in U1 samples, both light 
and heavy labelled (top right quadrant), and proteins specifically associated 


with 18S are consistently identified in 18S, both light and heavy (lower left 
quadrant). Background contaminant proteins have low enrichments (centre of 
panel) or are consistently found in the light channel and do not replicate 
between experiments (that is, keratin, streptavidin). c, SILAC ratio plot of 
replicate captures of U1 snRNA versus 45S pre-rRNA from one label-swap 
experiment. Proteins that are known to associate with 45S pre-rRNA are 
consistently identified in 45S captures. 
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Extended Data Figure 3 | Immunoprecipitation of the identified Xist- 
interacting proteins confirms Xist RNA interaction. RNA 
immunoprecipitation experiments were performed for seven Xist-interacting 
proteins (black bars), two control RNA binding proteins that were not 
identified by RAP-MS and IgG (grey bars) in UV-crosslinked cell lysate after 
6h of Xist induction by doxycycline addition (Methods). The RNA associated 
with each protein was measured and enrichment levels were computed relative 
to the level of the RNA in total cellular input and normalized to the total 
efficiency of capture in each sample to allow for direct comparison across all 
immunoprecipitation experiments (Methods). a, Enrichment of the Xist 
IncRNA after immunoprecipitation from a sample of pSM33 male cells. 


b, Immunoprecipitation of SHARP was performed from a sample of UV- 
crosslinked females ES cells that were treated with retinoic acid for 24h. The 
levels of recovered Xist IncRNA (black bars), Neat] IncRNA (white bars), and 
45S pre-ribosomal RNA (grey bars) were measured by RT-qPCR. Enrichment 
of each RNA after capture with anti-SHARP antibody was calculated relative to 
the level of RNA captured with IgG control antibody. c, The enrichment of 
various IncRNAs after immunoprecipitation in pSM33 male cells—including 
Neat1, Malat1, Firre, and Tugl1—are shown. d, The enrichment of various 
mRNA controls after immunoprecipitation in pSM33 male cells—including 
Oct4, Nanog, Stat3, and Suz12—are shown. 
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Extended Data Figure 4 | Previously identified proteins associated with XCI 
are not required for Xist-mediated transcriptional silencing. a, To confirm 
the specificity of our assay, we tested the function of several proteins that were 
previously identified to associate with Xist, but not to silence transcription, for 
their role in transcriptional silencing in our inducible male ES cells before Xist 
induction (—Dox; left) or after Xist induction for 16h (+Dox; middle and 
right). Representative images are shown after knockdown of each protein. 
DAPI (blue), Xist (red), and Gpc4 (green). b, Quantification of the copy 


number of Gpc4 before and after Xist induction upon treatment with different 
siRNAs. Error bars represent the standard error of the mean across 50 
individual cells from one experiment. ****P value < 0.001 between + Dox and 
-Dox cells based on an unpaired two-sample t-test. Scale bars on the images 
represent 5 tm. Importantly, while these proteins do not have a role in the 
initiation of transcriptional silencing, we do not mean to imply that they do not 
have other roles in XCI. 
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Extended Data Figure 5 | SHARP, LBR, SAF-A, SMRT, and HDAC3 are 
required for Xist-mediated transcriptional silencing. a, Representative 
images showing staining of DAPI (blue), Xist (red), and Gpc4 (green) for 
different siRNA knockdown in male ES cells before Xist induction (— Dox; left) 
or after Xist induction for 16h (+ Dox; middle and right). b, Quantification of 
the copy number of Gpc4 in -Dox and + Dox cells after knockdown with 
siRNAs targeting different mRNAs. Error bars represent the standard error of 
the mean across 50 individual cells from one experiment. NS, not significantly 


LETTER 


different between +Dox and —Dox cells; ****P value < 0.001 between +Dox 
and -Dox cells based on an unpaired two-sample t-test. Scale bars on the 
images represent 5 jum. c, Knockdown of SHARP, LBR, or SAF-A abrogates 
Xist-mediated gene silencing without causing pluripotency defects. 
Representative images showing staining of Nanog (cyan), Xist (red), and Gpc4 
(green) upon knockdown of SHARP, LBR or SAF-A after 16 h of Xist induction 
with doxycycline. Scale bars on the images represent 5 jim. 
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Extended Data Figure 6 | SHARP is required for silencing many genes 
across the X chromosome. a, A diagram showing the locations of Xist (red), 
X-linked silenced genes (black), and X-linked escaped genes (green) along the X 
chromosome. b, Representative images showing staining of DAPI (blue), Xist 
(red), X-linked silenced genes (green), and X-linked escaped genes (yellow) 
upon knockdown of SHARP or control male ES cells before Xist induction 
(—Dox) or after Xist induction for 16 h (+Dox). Knock of SHARP abolishes the 
silencing of Atrx, Gpc4, Rbmx, Smcla and Mecp2, which are normally silenced 


-Dox +Dox -Dox +Dox 


siScramble siSHARP 


upon Xist expression, but has no effect on Mid] and Pir, which normally escape 
Xist-mediated silencing. The bar graphs show the quantification of the copy 
number of the mRNA for each gene for —Dox and + Dox cells upon transfection 
with SHARP siRNA or control siRNA; error bars represent the standard error 
of the mean across 50 individual cells from one experiment. NS, not 
significantly different, ****P value < 0.001, and **P value < 0.01 between 
+Dox and -Dox cells based on an unpaired two-sample t-test. Scale bars on the 
images represent 5 tm. 
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Extended Data Figure 7 | Multiple independent siRNAs targeting SHARP, 
LBR, SAF-A, HDAC3, or SMRT demonstrate the same silencing defect. 

a, Representative images showing staining of DAPI (blue), Xist (red), and Gpc4 
(green) after knockdown of proteins using independent, non-overlapping, 
siRNA pools, or individual siRNA deconvoluted from the pool before Xist 
induction (— Dox; left) or after Xist induction for 16h (+Dox; middle and 
right). Cells were either transfected with the siRNA pool from Dharmacon 
(siRNA-D), Qiagen (siRNA-Q) or Ambion/Life Technologies (siRNA-A), or 
each individual siRNA deconvoluted from the pool from Dharmacon (siRNA- 


D1, 2, 3, 4) or Qiagen (siRNA-QI, 2, 3, 4). b, Quantification of the copy number 
of Gpce4 in -Dox and +Dox cells after knockdown with siRNAs targeting 
different mRNAs. Error bars represent the standard error of the mean across 50 
individual cells from one experiment. NS, not significantly different between 
+Dox and -Dox cells based on an unpaired two-sample t-test. Scale bars on the 
images represent 5 jm. We excluded all siRNAs that did not reduce the targeted 
mRNA level by >70% (Methods). The sequences of deconvoluted siRNAs are 
shown in Supplementary Table 2. 
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Extended Data Figure 8 | SHARP, LBR, SAF-A, SMRT, and HDAC3 are Gpc4 for -RA and + RA cells upon transfection with different siRNAs. Error bars 
required for transcriptional silencing in differentiating female ES cells. represent the standard error of the mean across 50 individual cells from one 

a, Representative images showing staining of DAPI (blue), Xist (red), andGpc4 — experiment. NS, not significantly different between +RA and -RA cells; ****P 
(green) upon knockdown of specific proteins using differentsiRNAsinfemaleES — value < 0.001, **P value < 0.01, and *P value < 0.05 between +RA and -RA 
cells before differentiation (— RA; left) or after differentiation for 24h (+RA; cells based on an unpaired two-sample f-test. Scale bars on the images represent 
middle and right). RA, retinoic acid. b, Quantification of the copy number of 5 um. 
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Extended Data Figure 9 | SHARP is required for exclusion of RNA 
polymerase II from the Xist-coated territory in differentiating female ES 
cells. Images of individual cells that are labelled with Xist (red), RNA 
Polymerase II (green), and DAPI (blue) across different siRNA conditions 
(rows) in female ES cells after 24h of retinoic acid treatment. The dashed white 
region represents the outlined Xist-coated territory. 
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Extended Data Figure 10 | SHARP is required for PRC2 recruitment across 
the Xist-coated territory in differentiating female ES cells. Images of 
individual cells that are labelled with Xist (red), Ezh2 (green) and DAPI (blue) 
across different siRNA conditions (rows) in female ES cells after 24h of 
differentiation. The dashed white region represents the outlined Xist-coated 
territory. 


©2015 Macmillan Publishers Limited. All rights reserved 


Medes Ae A lea 


doi:10.1038/nature14185 


Horizontal membrane- intrinsic a-helices in the 
stator a-subunit of an F-type ATP synthase 


Matteo Allegretti', Niklas Klusch’, Deryck J. Mills', Janet Vonck', Werner Kiihlbrandt! & Karen M. Davies! 


ATP, the universal energy currency of cells, is produced by F-type 
ATP synthases, which are ancient, membrane-bound nanomachines. 
F-type ATP synthases use the energy of a transmembrane electrochem- 
ical gradient to generate ATP by rotary catalysis. Protons moving 
across the membrane drive a rotor ring composed of 8-15 c-subunits’. 
A central stalk transmits the rotation of the c-ring to the catalytic F, 
head, where a series of conformational changes results in ATP syn- 
thesis’. A key unresolved question in this fundamental process is how 
protons pass through the membrane to drive ATP production. Mi- 
tochondrial ATP synthases form V-shaped homodimers in cristae 
membranes’. Here we report the structure of a native and active mi- 
tochondrial ATP synthase dimer, determined by single-particle elec- 
tron cryomicroscopy at 6.2 A resolution. Our structure shows four 
long, horizontal membrane-intrinsic a-helices in the a-subunit, 
arranged in two hairpins at an angle of approximately 70° relative 
to the c-ring helices. It has been proposed that a strictly conserved 
membrane-embedded arginine in the a-subunit couples proton trans- 
location to c-ring rotation‘. A fit of the conserved carboxy-terminal 
a-subunit sequence places the conserved arginine next to a proton- 
binding c-subunit glutamate. The map shows a slanting solvent- 
accessible channel that extends from the mitochondrial matrix to the 
conserved arginine. Another hydrophilic cavity on the lumenal mem- 
brane surface defines a direct route for the protons to an essential 
histidine-glutamate pair*. Our results provide unique new insights 
into the structure and function of rotary ATP synthases and explain 
how ATP production is coupled to proton translocation. 

To investigate the structural basis of proton translocation in F-type 
ATP synthases, we performed single-particle electron cryomicroscopy 
on detergent-solubilized mitochondrial ATP synthase dimers of the col- 
ourless green alga Polytomella sp. Dimers of Polytomella ATP synthase 
have a molecular mass of approximately 1.6 MDa and are particularly 
stable owing to their bulky peripheral stalk formed by the ATP synthase- 
associated subunits, ASA1-9 (ref. 6). The complex used for electron 
cryomicroscopy data collection had high oligomycin-sensitive ATPase 
activity (Extended Data Fig. 1). A total of 37,238 dimer images were com- 
bined to generate a map with an overall resolution of 7.0 A (Fig. 1, Ex- 
tended Data Fig. 2 and Supplementary Video 1). Subvolume masking 
indicated a resolution of 6.2 A for the rigid peripheral stalk and its 
associated membrane domains (Extended Data Fig. 3), whereas the 
resolution for the more flexible catalytic F, subcomplex and c-ring was 
7.4 A. o-Helices were clearly resolved in all parts of the symmetrized 
and unsymmetrized map (Extended Data Fig. 4). 

The ATP synthase dimer has a twofold symmetrical V-shape. Atomic 
models of homologous F-type ATP synthase subcomplexes”* were easily 
fitted into the map (Fig. 1a). The peripheral stalks form a solid scaffold 
of multiply-entwined, long a-helices. Cross-sections through the 
membrane-embedded part of the dimer reveal six transmembrane helices 
and a rotor ring of ten c-subunits, each forming a helix hairpin perpen- 
dicular to the membrane plane (Fig. 1b, c). The dimer interface extends 
from the peripheral stalk region through the hydrophobic membrane 
core to the lumenal membrane surface. About 80 A above the mem- 
brane surface, a helix-turn-helix motif protrudes from the peripheral 


stalk to interact with its counterpart in the other protomer (Fig. 1d). On 
the matrix membrane surface, a structural motif resembling Armadillo 
repeat proteins’ bridges the two peripheral stalks (Fig. le). In the 


Figure 1 | Electron cryomicroscopy map of the Polytomella F-type ATP 
synthase dimer. a, Side view. One half is fitted with atomic models of the F, 
subcomplex (0-subunits, cyan; B-subunits, blue; yée-subunits, grey; 
oligomycin-sensitivity-conferring protein, green; Protein Data Bank 
accession number 2WSS") and the c-ring (yellow, c,o-ring; Protein Data Bank 
accession number 3U2Y’). Density threshold levels are 1a (mesh) or 7 
(solid surface). b, c, Cross-sections through the membrane domains as 
indicated in a, showing the co rotor ring surrounded by detergent (diffuse grey 
density in b) and six transmembrane helices per protomer. c, Horizontal 
a-subunit helices are visible as elongated white densities. d, e, Helix-turn-helix 
motif (blue) and Armadillo repeat-like density (pink) connecting the two 
protomers, viewed in the direction of the arrowheads in a. Scale bar, 100 A. 
See also Supplementary Video 1. 
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Figure 2 | a-Subunit structure. a, Membrane-intrinsic helix hairpins of 
subunit a (blue) and cj9-rotor ring (yellow) viewed from the dimer interface 
(see Supplementary Video 2). b, Matrix view of the c-ring (yellow; fitted atomic 
model, Protein Data Bank accession number 3U2Y’) and C-terminal hairpin 
of subunit a with fitted helices. Blue and red dotted circles indicate the 
approximate positions of interchangeable residue pairs aR239/aQ295 (ref. 17) 


membrane, two transmembrane helices from each protomer line the 
dimer interface (Fig. 1c). On the lumenal surface, the ends of three trans- 
membrane helices interact to connect the two protomers (Supplemen- 
tary Video 1). 

Between each c-ring and the six transmembrane helices, the map 
reveals a bundle of four long membrane-intrinsic helices arranged in 
two hairpins, almost at right angles to the c-ring (Figs 1b, c and 2 and 
Supplementary Video 2). The longest (~80 A) membrane-intrinsic helix 
is embedded mostly in the lumenal membrane leaflet, where it bends 
around the c-ring for more than one-third of its length. The second 
helix in this hairpin is ~65 A long, roughly straight and connected to 
the curved long helix via a tight loop in the lumenal leaflet (Fig. 2a). 
Both helices are in contact with the c-ring. The other, shorter (~35 A) 
membrane-intrinsic hairpin is located between the six transmembrane 
helices and the longer helix hairpin, and is continuous with a helix den- 
sity that extends into the peripheral stalk on the matrix side (Fig. 2a). 
In Escherichia coli, the a-subunit can be multiply crosslinked to the 
c-ring’®"'. Since the four horizontal helices are the only protein density 
next to the c-ring in our structure, we assign them to subunit a. 

The horizontal orientation of the membrane-intrinsic a-subunit heli- 
ces was unexpected. Previous electron cryomicroscopy studies'”’* and 
chemical crosslinking’ had suggested four or more vertical helices in 
the a-subunit. However, our 6.2 A map of the Polytomella dimer is un- 
ambiguous in showing that the a-subunit helices are oriented horizon- 
tally (see Supplementary Discussion and Extended Data Fig. 5). 

A comparison of a-subunit sequences from bacteria, mitochondria 
and chloroplasts indicates a conserved carboxy (C)-terminal segment of 
90-100 residues (Extended Data Fig. 6), while the remainder is highly 
divergent. The C-terminal segment includes a strictly conserved argi- 
nine (aR239) that is essential for coupled proton transfer in E. coli*. Sev- 
eral residues in this segment can be crosslinked to the c-ring in E. coli'°™. 
We conclude that the long hairpin contains the conserved arginine and 
the residues that crosslink to the c-ring (Extended Data Fig. 7). We 
assign the free end of the curved helix (Fig. 2a) to the C terminus of sub- 
unit a, because the other helix in this hairpin is connected to protein 
density of the peripheral stalk at a lower contour level. 

The two long helices in the C-terminal hairpin comprise about 40 and 
50 residues, respectively. Examining the sequence of the Polytomella 
mitochondrial a-subunit, we detected two largely hydrophobic stretches 
of 41 and 47 residues, each with a proline at either end (Extended Data 
Fig. 6). Proline residues are often found at the start or end of a-helices”. 
We fitted a-helices of the corresponding sequences to the long 
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C terminus “ Matrix aqueous cavity 

(blue) and aE288/aH248 (ref. 5) (green). Red ribbon, prolines at helix ends. 
The dashed grey outline marks the aqueous matrix cavity. c, Orthogonal section 
through c-ring and the a-subunit four-helix bundle along the dashed line 

in b. In its proposed position, the strictly conserved aR239 (dark blue) interacts 
with proton-binding glutamates of the outer c-ring helices (red). Scale bar, 10 A. 


C-terminal hairpin (Fig. 2b). Similar, proline-delineated stretches are 
present in all compared a-subunit sequences (Extended Data Fig. 6), 
suggesting that a horizontal helix hairpin next to the c-ring is a feature 
common to all F-type ATP synthases. 

Our fit places the essential a@R239 next to the proton-binding c-subunit 
glutamate”’, at the point of closest proximity between the c-ring and 
a-subunit helices (Fig. 2b, c). In E. coli, this arginine can be exchanged 
against glutamine with 20% retention of activity, provided that a con- 
served glutamine in the second helical stretch is in turn replaced by 
arginine’. In Polytomella, the conserved glutamine is aQ295. Our fit 
places this glutamine in the long curved helix directly below the pro- 
posed position of aR239 (Fig. 2b). In E. coli, the equivalents of aE248 
and aH288 form another essential pair of conserved, mutually inter- 
changeable residues®. In our fit, G7E248 and aH288 are likewise directly 
above one another in the long helix hairpin (Fig. 2b). Our model 
explains why these two residue pairs are interchangeable, and how an 
arginine in place of aQ295 can substitute for aR239 in coupled proton 
translocation. 

On the basis of mutagenesis studies in E. coli, proton translocation 
through subunit a has been proposed to involve two aqueous half chan- 
nels'®. The ends of the two long hairpin helices line a slanting aqueous 
cavity on the matrix side of the detergent shell. The cavity extends down 
to the curved hairpin helix and narrows to a slit at the point where the a- 
and c-subunits interact in the middle of the membrane (Figs 2a, b and 
3a, c and Supplementary Video 3). The aqueous cavity partly exposes 
the matrix ends of three c-subunit helices (Fig. 3a). Protons carried into 
this cavity by the c-subunit glutamates would be rapidly lost to the high 
pH of the matrix solvent’’. In our model, the matrix cavity is lined by 
the amino (N)-terminal part of the long straight helix (residues 225- 
235), which contains conserved polar or charged residues for three helix 
turns upstream of aR239 (Extended Data Fig. 6). This helix segment is 
therefore partly hydrophilic, or amphipathic. Cysteine mutation exper- 
iments in E. coli indicated that residues in this region, as well as residues 
44-66 in the E. coli c-ring, are solvent-exposed'*”°”’, in excellent agree- 
ment with our model (Supplementary Video 4). On the lumenal mem- 
brane surface, we find a further cavity that extends from the cristae 
lumen to the site of the essential aH248/aE296 pair of interchangeable 
residues (Fig. 3b, d and Supplementary Video 3). The corresponding 
residues in E. coli have been shown to be ion-accessible’*. We conclude 
that this lumenal cavity is the entrance channel for protons crossing 
the inner mitochondrial membrane. 
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Figure 3 | Two aqueous half-channels. a, Left: side view of membrane domain 
in one protomer. Light blue, a-subunit helices; yellow, c-ring. The aqueous 
matrix cavity (boxed) is lined by amphipathic segments of the long hairpin 
helices and exposes the matrix ends of three c-subunits. Right: the aqueous 
matrix cavity narrows to a slit at the point where the a- and c-subunits interact 
(dark blue). b, Left: lumenal view of membrane region with aqueous cavity 
(boxed). Right: lumenal cavity with proposed position of interchangeable 
residue pair aE288/aH248 (green). Cross-sections along white lines in 

b, through the aqueous cavities on the matrix (c) and lumenal (d) side. 

Red arrows mark the proton paths into the lumenal (d) and out of the 
matrix (c) half-channels. Pink, detergent shell. Scale bar, 10 A. See also 
Supplementary Video 3. 


Structures of c-rings and molecular dynamics simulations suggest 
that the transition from a hydrophobic to a hydrophilic environment is 
sufficient to convert the proton-binding glutamic acid of the c-ring from 
a locked, protonated state to an open conformation, where the proton 
can be exchanged”””. In ATP synthesis, the c-ring rotates anticlockwise 
as seen from the matrix. This means that a protonated c-ring glutamate 
encounters the C-terminal part of the longest a-subunit helix before the 
solvent channel. All compared C-terminal a-subunit sequences contain 
polar and charged residues in this region, which may locally disrupt the 
matrix membrane leaflet, forming the aqueous matrix cavity. The hy- 
drophilic environment in this cavity would destabilize the locked con- 
formation, and thus promote the release of the proton into the matrix. 

Our model explains the direction in which the c-ring rotates under 
ATP synthesis conditions (Fig. 4). Thermal motion causes the ring to 
move backwards and forwards randomly as a Brownian ratchet”. A 
c-subunit glutamate accepts a proton through the lumenal half-channel, 
where the proton concentration is higher than on the matrix side. The 
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Figure 4 | Proton translocation through F-type ATP synthases. Protons 
(red) reach the conserved glutamate in the c-subunit via the aqueous lumenal 
half-channel (dashed grey circle). The proton competes with the strictly 
conserved aR239 (blue) for interaction with c-ring glutamates, which carry the 
proton around the c-ring. When the c-subunit approaches the hydrophilic 
half-channel on the matrix side (dashed grey outline), the glutamate becomes 
hydrated and adopts an open conformation, from which the proton can escape 
into the matrix. Solid grey circles indicate transmembrane helices. 


protonated glutamate cannot pass the conserved arginine*** but instead 
partitions into the hydrophobic environment of the lipid bilayer in a 
locked conformation””’. On the other side of the membrane, the polar 
environment of the matrix cavity hydrates another protonated c-ring 
glutamate, which adopts an open conformation, so that the proton can 
escape to the high-pH matrix solvent. The de-protonated glutamate 
cannot partition back into the lipid phase but passes the conserved argi- 
nine to become re-protonated. Because of the relative locations of the 
two half-channels, the conserved arginine and the direction of the elec- 
trochemical gradient, the ring can rotate only one way, unless energy 
released by ATP hydrolysis drives it in the opposite direction. By this 
mechanism, the electrochemical membrane gradient is converted into 
the torque that drives ATP synthesis in mitochondria, bacteria and 
chloroplasts. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Isolation of mitochondria. Cultures of Polytomella sp. (198.80, E. G. Pringsheim) 
were obtained from the Gottingen Collection of Algal Cultures (Sammlung 
Algenkulturen Gottingen (SAG), University of Géttingen, Germany) and grown 
at room temperature (23 + 2 °C) without agitation in MAP medium” with the addi- 
tion of trace elements and vitamins B, and Bj, (ref. 25). Cultures were made axenic 
by serial dilution and the addition of carbenicillin (100 pg ml7). To isolate mito- 
chondria, cells were harvested during their exponential growth phase (4 °C, 15 min, 
10,500g), resuspended in a minimal volume of isolation buffer (350 mM sucrose, 
20mM MOPS/NaOH pH 7.4, 1 mM EGTA) supplemented with 0.3% (w/v) fat- 
free BSA and protease inhibitors (Mini cCOmplete Protease Inhibitor Cocktail, Roche) 
and lysed by five passes through a ball-bearing homogenizer with 18 tm clearance, 
Isobiotec). The cell lysate was clarified by differential centrifugation, first at 2,000g 
to remove cell debris and then at 10,000g to pellet the mitochondria. Mitochondria 
were resuspended in minimal isolation buffer without BSA, frozen in liquid nitro- 
gen and stored at —80 °C for later use. 

Purification of ATP synthase dimers. Polytomella ATP synthase dimers were pu- 
rified according to a published procedure”® with modifications. Mitochondrial mem- 
branes (50 mg) were washed in buffer A (50 mM Tris-HCl, pH 8.0, 1 mM MgCl.) 
and resuspended in solubilization buffer (50 mM Tris-HCl pH 8.0, 1 mM MgCh, 
3.33% (w/v) n-dodecyl-B-b-maltoside (DDM)) to a total volume of 1.5 ml. After 
30 min incubation at 4°C, membranes were removed by centrifugation and the 
supernatant was loaded onto a diethylaminoethyl (DEAE) anion exchange column 
(bed volume, 2.5 ml, DEAE-Biogel A, Biorad) equilibrated in buffer B (50mM 
Tris-HCl, pH 8.0, 1 mM MgCl, 0.05% (w/v) DDM). The column was washed with 
buffer B-20, (buffer B with 20 mM NaCl) and the ATP synthase dimers were eluted 
with buffer B-55 (buffer B with 55 mM NaC)). The eluate was concentrated to 50 pl 
(Vivaspin 500 maximum spin speed columns with 50,000 molecular mass cutoff) 
and loaded onto a Superose 6 gel-filtration column (PC 3.2/30) equilibrated in buf- 
fer B-20 on an Ettan purifier (GE Healthcare). Fractions eluting at retention vol- 
ume 1.05-1.15 ml were collected and analysed for ATPase activity”. 

Electron microscopy. A 3 pilaliquot ofa 0.5 mg ml’ ATP synthase dimer solution 
in 0.05% (w/v) DDM showing oligomycin and DCCD-sensitive ATPase activity 
was applied to R2/2 Quantifoil grids without carbon backing, and plunge-frozen in 
a Vitrobot (FEI) at 75% humidity, 10 °C with 9 s blotting time. Dose-fractionated 1.5 s 
movies of 24 frames with an electron dose of three electrons per square angstr6m 
per frame were recorded after coma-free alignment”® with an FEI Polara electron 
microscope at 300 kV and 1.5-5 jum defocus on a back-thinned Falcon-II direct 
electron detector (FEI) as described”’. Images were collected at a nominal mag- 
nification of X59,000, corresponding to a specimen pixel size of 1.77 A. 

Image processing and map interpretation. Images without obvious drift were 
selected for further processing. Global beam induced motion of image frames was 
corrected”. The contrast transfer function for each image was determined using 
CTFFIND3 (ref. 31) within the RELION- 1.3 workflow”? and 49,600 particles were 
picked manually using EMAN boxer™. 

A sub-tomogram average of the Polytomella ATP synthase dimer, obtained from 
tomographic volumes of mitochondrial membranes, was used as an initial refer- 
ence for refinement in RELION-1.3. The handedness of the sub-tomogram average 
was determined and confirmed by comparison with known X-ray structures of ATP 
synthase subcomplexes”. 

Two-dimensional classification of the picked particles was performed in RELION- 
1.3. Three-dimensional classification was performed to identify structural hetero- 
geneity. Three out of the five classes obtained were virtually identical and showed 
well-resolved structural features. A total of 37,238 particles belonging to these three 
classes were combined for further processing. Individual frames were B-factor 
weighted and movements of individual particles were reversed by movie frame cor- 
rection in RELION-1.3 (ref. 36). The final c2-averaged volume was calculated from 
particles in frames 1-15. A B-factor of —200 A? for map sharpening was determined 
using the modulation transfer function of the Falcon II detector’’. 
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Resolution anisotropy was assessed using the local resolution estimation pro- 
gramme ResMap* (Extended Data Fig. 3a). Gold-standard Fourier shell correlations 
were calculated from two independently refined data sets to determine the resolution 
of the complete volume and sub-areas of the structure. For the latter, different 
regions of two independently refined maps were soft-masked and the resolution 
determined by Fourier shell correlations using EMAN2 (ref. 39) (Extended Data 
Fig. 3b). 

Sequences were aligned with ClustalW2 (ref. 40). Helices were fitted with DireX"’ 
and atomic models with Chimera”. 
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Extended Data Figure 1 | Oligomycin-sensitive ATPase activity. ATPase activity was ~90% inhibited by oligomycin, indicating that the F,F, complex 
assays”’ performed with the Polytomella ATP synthase dimers used for electron _is coupled. The vertical lines indicate addition of dodecyl maltoside detergent 


cryomicroscopy data collection indicated high activity of 6-8 units per to initiate the reaction. Measurements were performed in triplicates for 
milligram protein, close to the reported activity for this complex”. ATPase each purification. 
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Extended Data Figure 2 | Projection images of the Polytomella F-type ATP 
synthase dimer. a, Typical electron cryo-micrograph of dimers in vitrified 
buffer recorded at 2.5 um underfocus. b, Two-dimensional class averages and 
(c) corresponding projection images calculated from the final three- 
dimensional volume. d, Angular distribution of projection images used for 
three-dimensional reconstruction. ATP synthase dimers are oriented 
randomly in the thin layer of vitrified buffer. Scale bar in a, 50 nm. 
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Extended Data Figure 3 | Local resolution estimates. a, The unfiltered cross-sections through the density at the levels indicated on the right. b, Fourier 


electron cryomicroscopy map was analysed with the programme ResMap™ to _ shell correlation curves calculated from two independently refined data 
assess local resolution. The catalytic F, head (red to green) is less well-ordered _ sets after soft masking. The resolution was 7.0 A for the whole complex (red), 
than the peripheral stalk and the membrane-embedded a-subunit (green to 7.44 for the F,/c-ring complex (blue mask and curve) and 6.2 A for the 
blue), as indicated by the rainbow colour code on the left. The insets show peripheral stalk and a-subunit (yellow mask and curve). 
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Extended Data Figure 4 | Resolution of unsymmetrized map. Side view (a) __ resolution of 8.0 A for the unsymmetrized map, which shows all essential 
and top view (b) of the dimer map refined without imposing c2 symmetry features of the Polytomella dimer, including c2 symmetry. 
(blue). c, The gold-standard Fourier shell correlation curve indicates a 
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Extended Data Figure 5 | Horizontal helices in the three-dimensional map _ horizontal helices clearly. c, Ina two-dimensional map generated by projecting 
and in two-dimensional projection. a, Section of the three-dimensional the three-dimensional map volume in a along the indicated direction (arrow), 
map showing a side view of the cyo-ring on the left and the a-subunit onthe _ the long horizontal helices are in effect invisible, whereas the vertical 

right. b, A slice of the three-dimensional map volume indicated ina shows the _c-ring helices stand out clearly. Scale bar, 10 A. 
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Extended Data Figure 6 | Sequence alignment of about 120 C-terminal 
subunit a residues. Sequences of bacterial (blue), mitochondrial (red) and 
chloroplast (green) F-type ATP synthases are compared. The interchangeable 
residue pairs aR239/aQ295 and aE288/aH248 are dark blue and green, 
respectively; sequences of helices fitted in Fig. 2b are light blue. The black 


arrowhead marks the strictly conserved arginine essential for coupled proton 
translocation. Red, prolines; magenta, solvent-exposed residues'*; green, 
residues that crosslink to the c-ring’*”’. Identical (*) or similar (‘? or ‘’) 


residues are indicated. 
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E. coli Cp distance Polytomella 

residues | in model (A) residues 
WE a207/c55 10 a236/c53 
W) a214/c62 9 a243/c60 
OW a214/co5 10 a243/c63 
WW a221/co9 14 a250/c67 
1 a223/c72 16 a252/c70 
OW a224/c73 14 a253/c71 
WS) a225/c73 19 a254/c71 


E. coli CB distance | Polytomella 

residues | in model (A) residues 
WN) a263/c54 15 a306/c52 
WH) a262/c53 23 a305/c51 
| a262/c54 18 a305/c52 
WB) a262/c55 17 a305/c53 
WH a252/c62 8 a295/c60 
MN) a252/c6s5 9 a295/c63 
WN a251/c57 21 a294/c55 
MN a2s1/c59 17 a294/c57 
MN a251/c62 13 a294/c60 
| a248/ce2 16 a291/c60 
MM) a248/c63 12 a291/c61 
M) a248/c6s 10 a291/c63 


Extended Data Figure 7 | Subunit a and c crosslinking distances. Pairs of considerably shorter than crystallographic distances", most probably as a 


a- and c-subunit residues crosslinked in E. coli using (a) zero-length result of thermal motion during the long incubation times required for 
crosslinkers'® and (b) bis-MTS reagents''. Tabulated distances were measured _crosslinking'®. Nevertheless, the inter-residue distances in our model agree with 
between beta carbons of residues in the modelled a-subunit helices and the E. coli crosslinking data within reasonable margins, allowing for species 


Saccharomyces c-ring helices fitted to the map. Zero-length crosslinks canbe __ differences between bacterial and mitochondrial ATP synthases. 
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Electron cryomicroscopy observation of rotational 
states in a eukaryotic V-ATPase 


Jianhua Zhao'?*, Samir Benlekbir'* & John L. Rubinstein>?? 


Eukaryotic vacuolar H*-ATPases (V-ATPases) are rotary enzymes 
that use energy from hydrolysis of ATP to ADP to pump protons 
across membranes and control the pH of many intracellular com- 
partments. ATP hydrolysis in the soluble catalytic region of the 
enzyme is coupled to proton translocation through the membrane- 
bound region by rotation of a central rotor subcomplex, with peri- 
pheral stalks preventing the entire membrane-bound region from 
turning with the rotor. The eukaryotic V-ATPase is the most com- 
plex rotary ATPase: it has three peripheral stalks, a hetero-oligomeric 
proton-conducting proteolipid ring, several subunits not found in 
other rotary ATPases, and is regulated by reversible dissociation 
of its catalytic and proton-conducting regions’”. Studies of ATP 
synthases, V-ATPases, and bacterial/archaeal V/A-ATPases have 
suggested that flexibility is necessary for the catalytic mechanism 
of rotary ATPases*®, but the structures of different rotational states 
have never been observed experimentally. Here we use electron cryo- 
microscopy to obtain structures for three rotational states of the 
V-ATPase from the yeast Saccharomyces cerevisiae. The resulting 
series of structures shows ten proteolipid subunits in the c-ring, 
setting the ATP:H* ratio for proton pumping by the V-ATPase at 
3:10, and reveals long and highly tilted transmembrane a-helices in 
the a-subunit that interact with the c-ring. The three different maps 
reveal the conformational changes that occur to couple rotation in 
the symmetry-mismatched soluble catalytic region to the membrane- 
bound proton-translocating region. Almost all of the subunits of the 
enzyme undergo conformational changes during the transitions 
between these three rotational states. The structures of these states 


State 1 
(47% of images) 


Figure 1 | Rotational states of the V-ATPase. a, Cross-sections through a 

three-dimensional map of rotational state 1 of the V-ATPase Vj region (upper) 
and Vo region (lower) show that a-helices can be resolved. The best resolution 
in the Vo region is indicated with the pink arrow. b, The three different maps 


State 2 
(36% of images) 


provide direct evidence that deformation during rotation enables the 
smooth transmission of power through rotary ATPases. 

The V-ATPase from S. cerevisiae consists of subunits A3B;CD 
E3FG3Hac,c’ ,c’’,de (Extended Data Fig. 1), where x, y, and z denote 
unknown stoichiometries. In the soluble V, region, the three pairs of 
A- and B-subunits are arranged around the D-subunit of the central 
rotor. Each AB pair contains a single catalytic nucleotide-binding site 
and together the three AB pairs produce a pseudo-symmetric trimer of 
heterodimers, with each AB pair in a different conformation’. The 
three main conformations of the AB pairs are known as ‘tight’, ‘loose’, 
and ‘open’. These conformations are expected to bind ATP, ADP and 
phosphate, and no nucleotide, respectively, although their nucleotide 
content in crystal structures of catalytic regions from other rotary 
ATPases has depended on the crystallization conditions’*. During 
catalysis, the inter-conversion between the catalytic conformations 
of the three AB pairs is coupled to rotation of the central rotor, which 
extends into the membrane via subunit d, to the c,c',c’’,-ring. This 
rotary catalytic mechanism suggests that a population of intact 
V-ATPase complexes will be found in three or more structurally distinct 
rotational states. To investigate the rotational states of the eukaryotic 
V-ATPase, we isolated the complex from S. cerevisiae in detergent and 
imaged it by electron cryomicroscopy (cryo-EM) with a direct detector 
device camera (Extended Data Fig. 2). Classification of V-ATPase 
images enabled the identification of three distinct three-dimensional 
classes of varying size that gave maps at resolutions of 6.9.A (47% of 
images), 7.6 A (36% of images), and 8.3 A (17% of images) (Extended 
Data Fig. 3a, b), with 106,445 particle images ultimately contributing to 


State 3 
(17% of images) 


each allow docking of atomic models of subunits. The experimental map is 
shown in semi-transparent grey. The blue stars indicate the central rotor. Scale 
bar, 25 A. 


1Molecular Structure and Function Program, The Hospital for Sick Children Research Institute, 686 Bay Street, Toronto, Ontario M5G OA4, Canada. @Department of Medical Biophysics, The University of 
Toronto, Toronto Medical Discovery Tower, MaRS Centre, 101 College Street, Toronto, Ontario M5G 1L7, Canada. 3Department of Biochemistry, The University of Toronto, 1 King’s College Circle, Medical 


Sciences Building, Toronto, Ontario M5S 1A8, Canada. 
*These authors contributed equally to this work. 
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Figure 2 | The membrane-bound Vo region. a, The c-ring segment from the 
cryo-EM map accommodates ten c-, c’-, or c’’-subunits (magenta). b, The 
c-ring sits adjacent to the a-subunit (green). c, The a-subunit contains two long 
and highly tilted «-helices (red arrows). The conserved Glu residues of the 
c-subunits lie between the two tilted o-helices of the a-subunit. In the 


the three maps. Cross-sections through the maps show that a-helices 
are well resolved at these resolutions (Fig. 1 and Supplementary Video 
1). The class with the largest population most closely corresponds to an 
earlier cryo-EM map of the S. cerevisiae V-ATPase where conforma- 
tional separation was not performed” but differs from the conformation 
identified for the Manduca sexta V-ATPase”. 

Resolution does not appear to be homogenous within each three- 
dimensional map, suggesting further conformational heterogeneity 
within each class. However, additional classification strategies invol- 
ving focusing classification on the regions thought to be variable"! did 
not produce meaningful classes. The resolution appears better in the 
soluble V, region of the complex (Fig. 1a top and Extended Data Fig. 
3c) than in the membrane-bound Vo region (Fig. 1a bottom and 
Extended Data Fig. 3c). In the Vo region resolution is best where the 
a-subunit interacts with the c-ring (pink arrow in Fig. la bottom), 
suggesting a rigid interaction within an otherwise dynamic complex. 
Crystal structures and homology models for V-ATPase subunits were 
docked into the three different maps and their conformations refined 
by molecular dynamics flexible fitting’? (Fig. 1b and Extended Data 
Fig. 4). The most striking structural difference between the three maps 
is the position of the central rotor, consisting of subunits D, F, d, and 
the c-ring (blue star in Fig. 1b). Each map therefore appears to corre- 
spond to a rotational state of the enzyme. Rotational state 3 is the most 
different from the other two and is the class with the fewest particle 
images. The unequal distribution of particle images in the three classes 
suggests that, in the absence of free nucleotide, the V-ATPase relaxes to 
three unequally populated states. 

The proton-carrying c-ring of the yeast V-ATPase is a hetero-oli- 
gomer of subunits c, c’, and c’’, each possessing a single conserved 
glutamate residue (Glu137, Glu145, Glu108, respectively) that can bind 
and transport protons during catalysis’. Subunits c and c’ have four 
transmembrane a-helices each, with an additional amino (N)-terminal 
a-helix for c’’ that may not be membrane bound and is not necessary 
for function“*. The cryo-EM maps show the c-ring to consist of an inner 
ring and an outer ring of transmembrane a-helices (Fig. la bottom and 
Fig. 2a, b magenta). The ring appears to have 20-fold symmetry (Fig. la 
bottom and Fig. 2b), suggesting that the extra N-terminal «-helix of the 
c’’-subunit may protrude into the ring, as seen in a recent crystal 
structure of a bacterial heteromeric c-ring’*. The structure of the c-ring 
in each cryo-EM map accommodates a total of ten c-, c’-, or c’’-sub- 
units (Fig. 2a, b) with each subunit contributing two a-helices to the 
outer ring and two «-helices to the inner ring. The modelled c-subunits 
fit the density with their N and carboxy (C) termini facing the luminal 
side of the membrane’®. The presence of ten subunits in the c-ring is 
inconsistent with an earlier prediction of a 4:1:1 stoichiometry for 
subunits c, c’, and c’’, which would require a total of 6 or 12 subunits 
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eukaryotic V-ATPase c-ring, every other outer «-helix has a Glu residue (blue 
circles). Grey lines indicate the approximate membrane boundaries. d, The 
tilted o:-helices of the a-subunit (green) contact three different o-helices of the 
c-ring (magenta). Scale bar, 25 A. 


in the ring’”"*. A result of the existence of ten subunits in the c-ring is 
that in a tightly coupled enzyme complete rotation of the c-ring driven 
by hydrolysis of three ATP molecules would deliver ten protons across 
the lipid bilayer. Consequently, the c-ring stoichiometry probably sets 
the ATP:H’* ratio at 3:10 for the S. cerevisiae V-ATPase, which is the 
same ratio found with the F-type ATP synthase in S. cerevisiae mito- 
chondria’. The ATP:H™ ratio was not known previously for any eukar- 
yotic V-ATPase. With a Gibbs free energy for ATP hydrolysis of 
57 kJ mol | at 30°C, this ATP:H” ratio limits the maximum pH gradient 
or voltage established across the vacuolar membrane in S. cerevisiae to 3.0 
units or 180 mV, respectively”. 

The three maps provide the highest-resolution insight available into 
the structure of the membrane-bound portion of subunit a, which 
contains the channels that conduct protons to and from the c-ring. 
The transmembrane proteins in the Vo region of the S. cerevisiae 
V-ATPase are subunits a, e, and the c-ring. However, the dodecylmalto- 
side-solubilized V-ATPase does not contain the e-subunit”’. 
Consequently, any density in the Vo region that cannot be attributed 
to the c-ring or detergent must be part of the a-subunit (Fig. 2b-d). 
Subunit a has been predicted to have eight transmembrane «-helices”. 
The density corresponding to the a-subunit has a complex fold and 
appears to have at least eight transmembrane «-helices and several well- 
defined structural elements above and below the expected position of 
the lipid bilayer. Strikingly, there are two highly tilted «-helices from the 
a-subunit that span the lipid bilayer where the a-subunit is in contact 
with the c-ring (Fig. 2c, red arrows). These -helices contact a group of 
three a-helices from the c-ring (Fig. 2c, d). In F-type ATP synthases and 
V/A-ATPases each outer o-helix of the c-ring has a conserved proton- 
carrying Glu or Asp residue. In the eukaryotic V-ATPase every other 
a-helix on the surface of the c-ring has the conserved Glu residue. 
At the present resolution it is not possible to distinguish the outer 
helices with the conserved Glu residues from the outer helices lacking 
the Glu residues. Therefore, it is possible to fit the c-ring in the electron 
microscopy map in two different ways: one where a single Glu residue 
contacts the a-subunit and one where two different Glu residues contact 
the a-subunit. This latter arrangement would place two different c-sub- 
units (Fig. 2c, circled in blue) in different chemical environments, 
enabling one c-subunit to exchange protons with the cytoplasm while 
the other exchanges protons with the organelle matrix. This arrange- 
ment could create two half-channels for proton translocation”* and was 
seen with the Thermus thermophilus enzyme**. The -helices in the 
a-subunit are much easier to identify in the present maps at 6.9-8.3 A 
resolution than in the map of the T. thermophilus V/A-ATPase at 9.7 A 
resolution determined earlier”*. The «-helices identified here in all three 
states (Extended Data Fig. 5a) are consistent with the density from the 
earlier map of the bacterial enzyme (Extended Data Fig. 5b left) but are 
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Figure 3 | Symmetry mismatch between the V, and Vo regions. Upper, the 
central rotor D-subunit (dark blue) makes ~ 120° steps (dashed blue line to blue 
line) from state 3 to 1 (a), 1 to 2 (b), and 2 to 3 (c) relative to the rest of the V; 
region, which itself undergoes a slight twist relative to the Vo region (black 


not entirely consistent with the previously proposed locations of «-heli- 
ces in that density (Extended Data Fig. 5b right). 

With three catalytic nucleotide-binding sites, the V, region is 
expected to operate as a three-step motor while the ten titratable Glu 
residues of the c-ring suggest that the Vo region functions as a ten-step 
motor. This 3:10 ratio produces a symmetry mismatch between the V, 
and Vo regions. The new maps demonstrate how symmetry mismatch 
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Figure 4 | Flexibility of subunits. a, Comparison of AB pair 2 from the 
rotational states (state 2 in colour, states 1 and 3 in grey) shows that the pair 
progresses through ‘open’, ‘tight’, and ‘loose’ conformations. b-e, Overlay of 
structures from the rotational states illustrates that during rotation the 
d-subunit wobbles relative to the DF subcomplex (b), the peripheral stalks bend 


lines). Lower, in the Vo region, the d-subunit and c-ring undergo rotations of 
~120°, ~139°, and ~101° from state 3 to 1 (a), 1 to 2 (b), and 2 to 3 

(c), respectively (dashed cyan line to cyan line). PS, peripheral stalk. Scale bar, 
25 A. 


can be tolerated in rotary ATPases. The rotational position of subunit 
D of the central rotor was measured relative to the A3B3 hexamer 
(Fig. 3 upper, blue density and line), which itself twists within the rest 
of the enzyme between the three different states (Fig. 3, black lines). 
The rotation of subunit D corresponds to steps of 117° from state 3 to 
1, 119° from state 1 to 2, and 124° from state 2 to 3, all of which are 
in good agreement with the 120° steps expected for the threefold 


State 2 


' State 1 
State 3 


(c), the N-terminal domain of subunit a pivots towards the central rotor 

(d), and the ‘head’ domain of the C-subunit twists relative to the ‘neck’ 

(e). These movements are most apparent in Supplementary Video 3. Scale bar, 
25A. 
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pseudo-symmetric V, region. In the Vo region, rotation of one 
c-subunit against the a-subunit requires a 36° rotation of the c-ring 
(360°/10). The rotational position of the d-subunit and c-ring relative 
to the a-subunit could be measured precisely in rotational states 1 and 
2 owing to the resolution in the membrane region of these maps. In 
rotational state 3, the rotation of the c-ring against the a-subunit could 
not be measured with the same confidence. For the transition from 
state 1 to 2, we measured a 139° rotation of the d-subunit and c-ring 
relative to the transmembrane portion of the a-subunit (Fig. 3 lower, 
cyan density and line). This rotation from state 1 to 2 matches the 144° 
rotation (36° x 4) expected from a four-tenths rotation of the c-ring 
and would be expected to deliver four protons across the lipid bilayer. 
The transitions from states 2 to 3 and 3 to 1 were measured at ~101° 
and ~120°, respectively. Together, these rotations match ~216° of 
rotation (36° X 6), and are probably due to 108° (36° X 3) of rotation 
for each transition. Consequently, it appears that transitions from 
states 1 to 2, 2 to 3, and 3 to 1 transport four, three, and three protons, 
respectively. The observed rotational states reveal the conformations 
of the enzyme in vitro after the available ATP in solution has been 
hydrolysed. Therefore, it is possible that the conformations of the 
enzyme when rapidly hydrolysing ATP could be different from the 
conformations observed here. For example, when rapidly hydrolysing 
ATP, the stepping motions of the enzyme may be smoothed with 3.3 
protons transported for each ATP hydrolysis event. It is also possible 
that some ‘slip’ occurs during proton translocation and on average less 
than one proton is transported for each c-subunit. 

Overlaying the different conformations of various subunits in the 
complex suggests some of the structural changes that may occur during 
rotary catalysis (Fig. 4). These conformational changes are illustrated 
dramatically by interpolating between the three rotational state 
structures (Supplementary Video 2). The different conformations of 
the catalytic subunits of rotary ATPases have been reported from 
mitochondrial and bacterial F,-ATPases?*® and bacterial V,/A)- 
ATPases*”’ but never before for a eukaryotic V-type enzyme and never 
before within an intact rotary ATPase. Different from crystal structures 
of isolated V,/A,-ATPase or F,-ATPase subcomplexes, the availability 
of structures of the intact enzyme in different rotational states enables 
comparison of ‘open’, ‘tight’, and ‘loose’ conformations of the AB pairs 
when the rotor is in different positions. The observed conformations of 
the A3B; hexamer (Fig. 4a and Extended Data Fig. 6a—c) reveal a bend of 
the C-terminal domain of the A-subunit that closely resembles the con- 
formational changes seen in the Enterococcus hirae V,/A,-ATPase® and 
mammalian mitochondrial F;-ATPase” rather than the nearly rigid 
movement of subunits in the T. thermophilus V,/A,-ATPase”. The 
equivalent ‘open’, ‘tight’, and ‘loose’ conformations from different AB 
pairs can be overlaid with near perfect fidelity (Extended Data Fig. 6d-f). 
The protein samples used to prepare cryo-EM grids were not supple- 
mented with nucleotide and consequently the nucleotide occupancy of 
the different catalytic sites is unknown. Because the nucleotide occu- 
pancy of the different AB pairs is not known, it is possible that the 
observed states correspond to the intrinsic asymmetry seen in AB pairs 
and «8 pairs of the E. hirae and S. cerevisiae V,/A,- and F,-ATPase 
crystal structures lacking nucleotide, which closely resemble the confor- 
mations of the enzymes with bound nucleotide*”’. 

Rotary ATPases have been proposed to have an elastic coupling 
between their catalytic and membrane-bound regions to smooth the 
transmission of power between ATP hydrolysis or synthesis and pro- 
ton translocation’*. This need for elastic coupling is exacerbated by 
the 3:10 symmetry mismatch of the V, and Vo regions: the enzyme 
must deform to allow the rotor to be simultaneously in the correct 
position in the catalytic V, region and the membrane-bound Vo 
region. Earlier studies have suggested that the central rotor of the 
E. coli ATP synthase is the compliant element in that enzyme, while 
the peripheral stalk remains rigid”. The current structures indicate 
that the extended helical part of the central rotor D-subunit, equivalent 
to the F-type ATP synthase y-subunit, remains rigid during rotation 
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while the part of subunit D in contact with subunit d bends (Fig. 4b, 
dark blue, and Supplementary Videos 2 and 3). Further, the orienta- 
tion of the d-subunit changes with respect to the D- and F-subunits, 
wobbling to accommodate distortion of the enzyme during rotation 
(Fig. 4b, cyan, and Supplementary Videos 2 and 3). The catalytic A- 
and B-subunits push against the E- and G-subunits of the peripheral 
stalks during rotation. The E- and G-subunits of the peripheral stalks 
undergo a bending motion along their elongated coiled-coil region, 
reminiscent of the action of a cantilever spring (Fig. 4c, Extended Data 
Fig. 6g-i and Supplementary Videos 2 and 3). The N-terminal domain of 
the a-subunit swings parallel to the membrane, moving to and away 
from the rotation axis of the rotor like the arm of a record player. The 
bend in the a-subunit occurs at the narrow interface between its N- 
and C-terminal domains (Fig. 4d and Supplementary Videos 2 and 3). 
In comparison, the ‘head’ domain of the C-subunit twists like a torsion 
spring at the ‘neck’ domain and thereby maintains its connections 
between peripheral stalks 2 and 3 (Fig. 4e and Supplementary 
Videos 2 and 3). The existence of these conformational changes is 
not obvious from inspection of crystal structures of the individual 
subunits. However, when visualized as a movie, each subunit appears 
to have evolved to perform these motions. Overall, the structures 
presented here show that in the V-ATPase both the rotor and stator 
parts of the engine undergo coordinated conformational changes. 
This combination of flexibility and rigidity is probably a key attribute 
for the function of these highly efficient macromolecular machines. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

S. cerevisiae V-ATPase samples were purified as described previously from 101 
fermenter growths of S. cerevisiae strain JTY002 (ref. 9). The enzyme was purified 
by affinity to 3 Flag tags at the C termini of the A-subunits, providing samples 
(30 pl at ~10 mg ml ’) in buffer containing 50 mM Tris-HCl (pH 7.4), 150 mM 
NaCl, 0.02% (w/v) DDM, and 150 1g ml! 3XFlag peptide, without any added 
nucleotide. Holey carbon-film-coated electron microscopy grids with regular 
arrays of 500-800 nm holes were prepared by nanofabrication*®® and cryo-EM 
specimens were prepared as described previously’. A total of 3,685 30-frame 
movies were acquired with defocuses between 1 and 7 tm with an FEI Tecnai 
F20 microscope operating at 200 kV and equipped with a Gatan K2 Summit direct 
detector device camera. The camera was used in counting movie mode with five 
electrons per pixel per second for 15s and 0.5s per frame. This exposure rate 
resulted in one electron per square angstrém per frame on the specimen. Frames 
were aligned using the new program alignframes_lmbfgs*! and averaged. 
Averaged frames were used for determination of contrast transfer function para- 
meters with CTFFIND3 (ref. 32) and selection of coordinates for 200-250 particle- 
like features per image, some of which corresponded to true particle images, by 
template matching with TMaCS*’. These coordinates were then used to extract 
candidate particle images from the unaligned movie frames with individual par- 
ticle motion correction using the alignparts_Imbfgs algorithm*'. A measured ~2% 
anisotropy in the magnification of the microscope was corrected in particle images 
by interpolation in Fourier space**, to produce a calibrated pixel size of 1.45 A. 
Contrast transfer function parameters were corrected to account for the effects of 
this anisotropy™. A total of 790,642 candidate particle images were subjected to 
two- and three-dimensional classification with Relion**”*. Particle images in two- 
dimensional classes with averages that resembled projections of the V-ATPase and 
contained high-resolution features were selected for further analysis. Three- 
dimensional classes were obtained from 217,533 particle images and three classes 
containing 106,445 particle images were used to build the three final three-dimen- 
sional maps. Local resolution was estimated with ResMap”’. Calculations with 
Relion were performed using the SciNet cluster’* and the SickKids High 
Performance Facility. Three-dimensional maps were segmented with UCSF 
Chimera**”’. Homology models were calculated either with Phyre2 (ref. 41) or 
with HHpred” and MODELLER® and atomic models from the PDB and homo- 
logy models were docked into electron microscopy maps by MDFF”. Crystal 
structures were available for subunits Ct and H*, and the DF*® and EG” sub- 
complexes. Homology models were built for subunits A, B*”’, d**, the N-terminal 
domain of subunit a”, and the c-subunit”’. The magnitudes of subunit rotations 
between states were measured in UCSF Chimera and movies were generated with 
UCSF Chimera. 

Code availability. All new computer programs described above are available from 
https://sites.google.com/site/rubinsteingroup/home. 
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Rotary catalysis in V; region 


Extended Data Figure 1 | V-ATPase subunits and rotation. a, The 
V-ATPase from S. cerevisiae consists of subunits A3B3;CDE3FG3Hac,c’,c'’de, 
where x, y, and z denote unknown stoichiometries. Subunits with upper-case 
letter names correspond to components of the soluble V, region while lower- 
case names denote components of the membrane-bound Vo region. The 
e-subunit is not found in the detergent-solubilized S. cerevisiae V-ATPase. 
During rotary catalysis, ATP hydrolysis drives rotation of the rotor, consisting 
of subunits DFc,c’,c’’,d (outlined in black), which rotates relative to the rest of 
the enzyme. Upper inset, the three different nucleotide-binding sites of the V; 
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region can be found in three different conformations: ‘tight’ (where ATP is 
expected to be bound), ‘loose’ (where ADP is expected to be bound), and ‘open’ 
(where no nucleotide is bound). Lower inset, rotation of the c,c’,c'’,-ring against 
the a-subunit leads to proton translocation from the cytoplasmic side of the 
membrane to the luminal side of the membrane. Proton translocation occurs via 
two half-channels through the membrane. b, V-ATPase activity is regulated by 
reversible dissociation where the V, region separates from the Vo region. The 
H-subunit inhibits ATP hydrolysis in the dissociated V, region. Proton 
translocation in the dissociated Vo region is blocked by an unknown mechanism. 
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Extended Data Figure 2 | Data collection. a, A representative micrograph; automatically from template matching. b, Tracking of particle and other image 
examples of V-ATPase particle images are shown circled in red. These particle _ feature trajectories with the alignparts_lmbfgs algorithm’’. Trajectories are 
images were selected from the 200 candidate particle images identified exaggerated by a factor of 5 to allow visualization. 
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Extended Data Figure 3 | Three-dimensional maps from rotational states. curves at a Fourier shell correlation of 0.143 are the same as the resolutions 
a, Surface rendered views of the three three-dimensional maps are shown. Scale __ measured after correcting for masking effects by high-resolution noise- 

bars, 25 A. b, Fourier shell correlation (FSC) curves after a “gold standard’ substitution calculations*’. c, Local resolution estimation shows that features in 
refinement of the three maps are shown. The resolutions measured from these __ the V region are better resolved than in the Vo region. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


State 1 State 2 State 3 
Extended Data Figure 4| Map segmentation and molecular dynamics _AB pair 3 (a), the N-terminal domain of subunit a (b), the central rotor 
flexible fitting. Different subunits are shown fitted into their DFd subcomplex (c), subunit C (d), and peripheral stalk 1 (e). Scale bar, 
corresponding map densities in rotational states 1, 2, and 3, including 25 A. 
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a S. cerevisiae 


State 1 State 2 State 3 


b T. thermophilus 


Previously identified 


New segment transmembrane densities 
Extended Data Figure 5 | C-terminal domain of the a-subunit. a, The resolution’ is consistent with the structure of the a-subunit from S. cerevisiae 
membrane-bound C-terminal domain of the a-subunit appears similar in all _(left). However, the transmembrane «-helical densities identified previously in 
three rotational states. b, The density from the C-terminal domain of the that map (right) are not entirely consistent with the current maps. 


Thermus thermophilus subunit I, equivalent of the a-subunit, at 9.7 A 
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Extended Data Figure 6 | Flexibility in V-ATPase subunits. a-c, Each AB _ conformations are nearly the same for each AB pair. g-i, Each of the three EG 
pair in the A3B; hexamer goes through ‘open’, ‘loose’, and ‘tight conformations __ peripheral stalk structures undergoes similar bending motions between the 

as the enzyme passes between the three rotational states. d-f, Overlay of all three rotational states. Scale bar, 25 A. 

three open, all three loose, and all three tight structures shows that the 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature14470 


Erratum: Entanglement with 
negative Wigner function of almost 
3,000 atoms heralded by one 
photon 


Robert McConnell, Hao Zhang, Jiazhong Hu, 
Senka Cuk & Vladan Vuleti¢ 


Nature 519, 439-442 (2015); doi:10.1038/nature14293 


In Fig. 2a of this Letter, the solid blue line in the linear plot was 
inadvertently removed during the production process; this figure 
has now been corrected in the online versions of the manuscript. 
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ADJUNCT TEACHING 


For love of the lecture 


Contract teaching positions are becoming the norm for many aspiring professors. 
Know how to make the best out of them. 


BY KENDALL POWELL 


L: year, after finishing work each day 


at her administrative job with an art 

company, Susan Finley drove to Santa 
Monica College, California, to do what she loves 
most — teaching. On contract as an adjunct 
professor, she ran courses in psychology and 
counselling. She had 120 students last semester, 
but no office, which made holding office hours a 
challenge. Instead, she would meet students over 
dinner in the cafeteria or in a park on campus. 


Contingent faculty members, also knownas 
adjunct staff, can be full- or part-time, are off 
the tenure track and are generally paid on a 
per-course or hourly basis. As contract posi- 
tions, the jobs bring greater challenges and less 
access to university resources than permanent 
positions. Yet despite the less-than-ideal work- 
ing conditions, contingent faculty members 
such as Finley cherish the role for the tangible 
connection that it provides to a university and 
an academic department. 

Before and after her classes, Finley advises 
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and tutors students, a commitment for which 
she receives no compensation. “I’m one of the 
first professors these students have, and I love 
it? she says. “But I'm not paid for those hours, 
even though those are the hours I remember 
the most. Financially, it’s not easy.” 

She is taking a gamble that the part-time 
work will help her to land a full-time teaching 
position — at least eventually. Thanks to other 
faculty members who have observed and 
reviewed her work, and students who have 
evaluated her performance positively, she 


14 MAY 2015 | VOL 521 | NATURE | 247 


> has made valuable inroads on that goal. But 
there are no openings for a full-time faculty 
member in the department — and that is a 
burgeoning trend in the United States and 
other nations. 

Although the number of part-time and 
contract teaching positions is growing quickly 
at universities across North America and 
elsewhere, full-time positions — both on and 
off the tenure track — have become more and 
more difficult to find. In 1975, tenured and 
tenure-track faculty members made up 45% 
of the faculty at US colleges and universities, 
and part-time instructors just 25%. By 2011, 
the numbers had reversed, with just 23% on 
the tenure track or already tenured, and part- 
time staff soaring to 42%. Today, more than 
two-thirds of instructional staffare non-tenure 
track, and most of them are on temporary con- 
tracts (see go.nature.com/gws6ow). 


WHET THE APPETITE 
At 32 and with a newly minted doctorate, 
Finley thinks that teaching on a contract basis 
is a good way to test the waters of various 
career paths, but does not see it as a viable 
long-term prospect. Others in similar roles 
agree that this is the appropriate attitude to 
take in the current job market. 

Non-tenure-track teaching positions, 
often called ‘instructor or ‘lecturer? can offer 
freedom and flexibility, but often provide 
much-lower pay — salaries frequently start on 
a par with graduate-student stipends. For those 
who want to keep a hand in science, but not 
entirely at the bench, these roles offer a chance 
to hone teaching skills and work directly with 
undergraduate science students. They can 
also come with a hefty dose of advising and 
counselling responsibilities. Some instructors 
maintain limited research programmes; for 
others, research is out of reach. 

Early-career scientists who are considering 
this route should 


understand that itis “I’m not paid 
not generally realistic for those 

or sustainable asa hours, even 
long-term option thoughthose 
or for those who arethehours! 
want to engage in remember the 
substantial research. most.” 


Still, some instructors 
do manage to turn part-time posts into full- 
time positions, and a rare few jump to the 
tenure track at their institutions. Others use 
these posts as a bridge back to the academic 
environment following stints in other sectors, 
or as a way to supplement other work and 
fulfil their love of the classroom. Scientists 
who intend to land a full-time post in 
academic instruction will need to hone their 
time-management strategies and capitalize on 
their connections. 

A passion for teaching is the most important 
attribute for success in the face of low pay, 
instructors say. Also desirable is a fondness 
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as. 


Ethan Tsai delivers an organic-chemistry lecture at the Metropolitan State University of Denver. 


for and an understanding of the university 
student population — knowing how to spark 
and maintain that group’s attention. Beyond 
that, would-be instructors must be willing and 
able to invest large chunks of time for which 
they are not always compensated. 

“T put alot of thought into trying to keep stu- 
dents interested — what is this age group into?” 
says Pamela Buzas-Stephens, an adjunct geolo- 
gist who juggles teaching positions at both the 
University of Colorado Boulder and Front 
Range Community College in Longmont. But 
determining how to keep students engaged is 
just the start, she says. Instructors need hours 
each week to prepare for a course, even if they 
have plenty of experience in front of a class. 
She estimates that an instructor who teaches a 
course for the first time will spend 3-6 hours 
preparing for every 1-hour session. In her 
experience, a 3-hour-per-week introductory 
course entails a minimum of 25 hours’ worth 
of classroom activities, grading and advising. 

There are ways to save time. For large 
introductory courses, first-time instructors 
could borrow notes, slides and syllabuses from 
colleagues who have taught it in the past. And 
they should prepare lectures far enough in 
advance to give themselves a buffer zone if they 
get busy with research or other obligations. 

Buzas-Stephens relies heavily on allocating 
strict time slots for specific activities to limit 
the hours that she spends on course prepara- 
tion. A PowerPoint presentation might get two 
hours, she says. Then it is time to move on. 

Another huge time sink for many instructors 
is responding to student e-mails. Ken Diebel, 
an instructor in the fisheries and wildlife 
department at Oregon State University (OSU) 
in Corvallis, generates a ‘frequently asked 
questions’ document for each course to cut 
down on e-mails. Jennifer Stempien, a geology 
instructor and student adviser at the University 
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of Colorado Boulder, has e-mail templates that 
she uses for common student dilemmas. She 
also preloads as much material as possible 
into the university's online course systems, 
then sets dates to roll out information as the 
semester progresses. 

One of the trickiest aspects of 
Stempien’s job — and a highly time- 
consuming one — has little to do with geology. 
Every year, she finds herself advising a handful 
of her 700 students on their daunting personal 
challenges, including physical and mental- 
health issues, and even suicide attempts and 
assaults. “I have all the appropriate campus 
offices on speed-dial;” she says. 

Technological advances such as the 
emergence of online instruction are putting 
extra strain on many instructors’ time. Ecologist 
Luke Painter runs a mammal-systematics class 
online at OSU and had to adjust to recording 
video lectures. “I feel like 'm teaching in the 
dark, because I know the students are out there, 
but I can't talk to them directly,” he says. 

Diebel, who teaches an online course on 
riparian ecology, received a small grant last 
year for improvements such as videotaping 
lecture segments in the field. He attends an 
annual online-teaching symposium at OSU to 
learn what innovations other instructors use. 


SIDELINE ACTIVITIES 

With such heavy time constraints, it can be 
difficult to add research to adjunct jobs. But that 
has not stopped many instructors from trying. 
Stempien happened on an opportunity when 
she returned to the department in which she 
had done her postdoc to finish up a collabora- 
tion. While there, she ran into her department 
chair, who was looking for someone to teach 
a few introductory class sections. The chair 
hired her on a per-semester basis initially, and 
eventually offered her a three-year part-time 
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instructor contract. The move brought with 
it better benefits, compensation and access to 
departmental and university resources. It also 
gave her a place in the academic geology world 
that she loves. 

She has struggled to carve out time to write 
up her research on student learning in the 
geosciences. Her work requires only database 
access, and she feels fortunate to be ina 
department that encourages her to continue 
her research. But it will have to wait until her 
summer break, when classes end. 

Other instructors find ways to carry out at 
least some research during the academic year. 
Keen to continue her palaeontological studies 
on microfossils, Buzas-Stephens acquired a 
few necessary instruments and began to put in 
extra hours at her desk, examining specimens 
of tiny, shelled organisms called foraminifera. 
And Painter says that at OSU, instructors are 
allowed to apply for grant funding as investi- 
gators. He plans to follow this route to resume 
his research on the effects of bison and elk 
grazing on trees. US universities are increas- 
ingly offering this option to instructors to bring 
in more grants. 

Others are happy to leave research out of 
the academic equation. Diebel, who worked 
for Oregon state agencies as a riparian 
specialist for 20 years, combines instructing 
with stream-restoration work as a private 
consultant. The consultancy allows him to 
teach part-time while maintaining his income, 
giving him more freedom and less bureau- 
cracy. “The cool thing is, I don’t have the 
‘publish or perish’ or the tenure-promotion 
stuff to worry about. I focus on what I want to 
do, which is teaching,” he says. 

Many instructors select these part-time jobs 


aes 


Adjunct Pamela Buzas-Stephens runs a field trip. 
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ADVOCACY FOR ADJUNCT ROLES 


Groups give a voice to part-time faculty 


Adjunct faculty members have connected 
across institutions, taking a stand against 
an education system built on jobs that are 
often precarious and demoralizing. Maria 
Maisto, president and executive director 
of the New Faculty Majority in Akron, Ohio, 
an advocacy group for US contingent and 
adjunct staff, says that the group formed in 
2009 to provide guidance and support to 
the growing population of adjunct teachers. 
It aims to alert those inside and outside 
academia to the problems associated with 
such positions, by collecting statistics and 
advocating for practical solutions. 

The Delphi Project, headquartered 
at the University of Southern California 
in Los Angeles, and the nationwide US 
Coalition on the Academic Workforce help 
adjunct faculty to organize and lobby for 


for the flexibility, and may engage in consulting 
on the side, enjoy semi-retirement or spend 
time with family members. Some, however, 
view adjunct teaching as a stepping stone to 
a more-permanent post — for better or for 
worse. Teisha Rowland, a postdoc in cardiology 
at the University of Colorado Denver, is 
teaching general biology at the Metropolitan 
State University of Denver (Metro State) part- 
time to gain crucial teaching experience for her 
tenure-track job search. 

“Tt’s invigorating to be around that student 
energy the first time they learn about science,’ 
she says. But she is well aware that her adjunct 
position is best treated as temporary and as a 
way to build a CV. Buzas-Stephens agrees that 
science instructors who try to cobble together 
multiple low-paying adjunct contracts into a 
career are setting themselves up for heartache. 

Andrew Robinson at Carlton University 
in Ottawa knows that heartache firsthand. 
He excels at the front of a lecture hall, but has 
struggled to convert his position teaching 
physics into a full-time, salaried post. “The 
chances of getting an academic job are already 
small — if you go into adjunct teaching, your 
chances probably get smaller” 

Robinson and his wife both teach part-time 
to give them the flexibility to care for their son, 
who has special needs. But Robinson is bitter 
that, at age 52 and with 25 years of academic 
and industry experience, he works long hours 
teaching courses year-round to scrape together 
Can$35,000 (US$29,000) a year. 

“There is no respect for contract instructors 
at all,” says Robinson. “We are viewed as 
temporary replacements, which can be thrown 
away. His sentiments echo the frustrations 
of adjunct faculty members across North 
America, who have launched a movement to 
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better working conditions. These groups 
support unionization efforts and work on 
legislative solutions at the state and federal 
levels. In 2013, Maisto testified before 

the US Congress for its 2014 report, The 
Just-In-Time Professor, which discusses the 
difficulties facing those in contingent roles 
(see go.nature.com/qtp9bs). 

The Adjunct Project, a crowd-sourced 
database hosted by the Chronicle of Higher 
Education (adjunct.chronicle.com) in 
Washington DC, provides information on pay 
rates, benefits and working conditions across 
the United States. For example, the database 
shows that salaries for teaching biology in 
the Denver, Colorado, area range, at present, 
from US$950 to $8,000 per course. This kind 
of knowledge can help those in contingent 
roles to advocate for themselves. K.P. 


improve their working conditions and pay (see 
‘Groups give a voice to part-time faculty’). 

Still, scientists can take inspiration from the 
rare few adjunct instructors who do manage 
to make lemonade from lemons. Through 
hard work and perhaps a bit of luck, Ethan 
Tsai, a chemist at Metro State, achieved the 
near-impossible: he converted an instructor 
position to a tenure-track faculty spot in the 
same department. 

Tsai began as a contract instructor for an 
organic-chemistry laboratory course and, over 
the next few years, was asked to take on more 
and more duties — including redesigning the 
lab-course curriculum and co-authoring a 
grant for the department to acquire a higher- 
resolution nuclear magnetic resonance (NMR) 
machine to analyse components of chemical 
samples. After the instrument arrived, he 
became its default manager. 

Tsai ticked all the boxes for landing a ten- 
ure-track post: he made himself indispen- 
sable to the department by bringing in both 
grant money and research opportunities. He 
also showed his future colleagues that he was 
an enthusiastic collaborator who fit in the 
department's culture — a key aspect for tenure- 
track hires. Those connections made him 
the strongest candidate for his department's 
faculty search later on. 

He describes his years on contract, when he 
was putting in 60-hour weeks, as doing heavy 
labour for graduate-student pay. Yet, he says, 
it was worth it for the reward of a tenure-track 
position — and his love of teaching made it 
possible. “IfI didn't enjoy teaching, I would not 
have lasted beyond a year,’ he says. m 


Kendall Powell is a freelance writer in 
Lafayette, Colorado. 
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WHEN LAST | SAW THE STARS 


BY JEFF HECHT 


I told my grandmother. “I promised 
I would tell you first” 

“Yes, Little Helen. You have been good 
about that,” she said. My parents called her 
Big Helen, because she had been so much 
taller when I was little. I grew just as tall as 
she was, but now she had 
shrunk and was too frail 
to stand for very long. 

“Remember the new 
space telescope out 
past the Moon that is so 
big and so powerful it 
can see planets around 
other stars? I used it to 
see something new.” Its 
30-metre mirror sees far 
into the infrared, where 
planets like Earth glow 
with their own heat. I 
had told her before, but at 
104 years old, she doesn't 
remember details well. 

“You told me about 
that before, Little Helen. 
When last I saw the stars, 
you were just a little girl” 

I was not so little then. 
I had been 12, visiting 
her and my grandfather 
at their summer cabin in 
rural Maine. We stayed 
up late, turned out all the lights, and went 
out to the open field behind the house. 
Leaning back in old aluminium lounge 
chairs, we saw the whole Universe spread 
across a deep black sky. My grandfather 
pointed out the Milky Way, handed me his 
big binoculars, and told me to look closely. 
When I focused the binoculars, the cloudy 
areas turned into more and more stars, and 
I was enchanted. 

“They were beautiful,” I told her. “T still 
remember that. It made me want to be an 
astronomer.’ Remembering how long ago 
that had been, I was grateful for the medical 
miracles that had let her live two decades 
longer than her grandparents had. 

“Do you remember what I told you then, 
Little Helen?” 


CC | news that no one else knows,” 


“About the Milky 
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Lost vision. 


again. The first global geoengineering 
project was spreading haze across the sky to 
cool the ground. You didn't notice, but we 
could already see the scattered light in the 
night sky, as if city lights had come to our 
country cabin.” 

“We can see the stars better from space 
telescopes than we ever could from the 
ground,’ I told her. “There's no air in the way.” 


“I want to look up and see stars all across 
the sky, Little Helen,” she said. “I haven't 
seen them in so many years; the haze and 
the lights are everywhere.” 

“We need the geoengineering aerosols, 
grandmother,’ I said. “They are the only 
way we can control the climate, to keep it 
from getting too warm, and melting the rest 
of Antarctica and Greenland.” 

“We could stop burning coal,’ she said. 
“I remember when governments talked 
about stopping greenhouse-gas emissions 
and not lighting up the roads and highways 
and the whole outdoors all night long. We 
called it conservation.” 

She and my grandfather had told me 
how they had worked to control pollution. 
They scared me when they told me what 
the future would bring. “Outdoor lighting 
does good things, grandmother,’ I said. “It 
lets us see when we go out at night. It pre- 
vents crime. It sustains our modern 24-hour 
lifestyle” 
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“There’s too much light and too much 
pollution, Little Helen,” she said. “But I don’t 
want to argue. You said there was something 
you wanted to tell me. I want to hear your 
news.” 

I smiled. That was my grandmother, 
trying to avoid family arguments. “What I 
wanted to tell you was that the new telescope 
works so well we can see the disks of planets 
the size of Earth around 
other stars. We found 
one with oxygen in the 
air, and with other gases 
that are signs of life and 
civilization. We found 
the same sulfates that 
we spray into the strato- 
sphere to keep our Earth 
cool. We think the planet 
may have a civilization.” 
It was only 30 light years 
away, and I wondered 
why we had heard no 
signals from them. But I 
didn’t want to complain 
to my grandmother that 
no one cared enough 
about other intelligent 
life to approve my plan to 
signal to them. 

“That is wonderful to 
hear, Little Helen. I never 
thought that we would be 
able to see planets so far 
away. But what I want to 
see is stars sprinkled across a dark sky, and 
the whole glorious sweep of the Universe.” 

“Thave pictures,’ I said. “Ican show you.” 

“T don't want little pictures. You cannot 
know the Universe unless you see the stars 
spread across the whole sky,” my grand- 
mother said. “When I was young, I looked 
up at a sky full of stars and told your grand- 
father that someday our grandchildren’s 
grandchildren would go visit the stars they 
saw in the sky. But if your grandchildren’s 
grandchildren can't see the Universe of stars 
spread across the sky, they will never try to 
explore it?” 

I shivered, remembering how small my 
postcard of the Grand Canyon seemed after 
I had stood in the bottom of the canyon. I 
turned away so my grandmother would not 
see me Cry. m 


Jeff Hecht is Boston correspondent for New 
Scientist and a contributing editor to Laser 
Focus World. 
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olorectal cancer (CRC) kills almost 700,000 people 

every year, making it the world’s fourth most deadly 

cancer (after lung, liver and stomach cancer). It is a 
disease of modernity: the highest rates of incidence are in 
developed countries (see page S2). As the world becomes 
richer, and more people shift to Western diets and lifestyles, 
the incidence of CRC is likely to increase. 

But there is encouraging news. Screening for the disease 
has become routine in many parts of the world, and newer, 
less invasive technologies are being developed to replace the 
highly invasive colonoscopy (S4). And evidence is mounting 
that the disease could be largely prevented by a combination 
of drugs, nutritional supplements, diet and exercise (S6). 

There is encouraging research on several fronts. The 
development of three-dimensional ‘organoids’ derived from 
adult stem cells could help match drugs to a patient’s specific 
tumour type (S15). And DNA fragments that have escaped 
from tumours into the blood could be detected in ‘liquid 
biopsies’ that can characterize the tumour and help monitor 
the effectiveness of treatments (S9). 

On the therapeutic front, however, progress is a matter of 
inches. New drugs have roughly doubled the average survival 
time for advanced CRC over the past decade, but the patient 
usually dies within three years (S12). 

As with many other diseases, scientists are finding that the 
organisms that live inside us play a big role (S10). But whether 
an abnormal microbiome is a cause of CRC or an effect is still 
shrouded in mystery. This is just one of several fundamental 
questions about the disease that remain to be resolved (S16). 

We are pleased to acknowledge that this Outlook was 
produced with the support of F Hoffmann-La Roche Ag and 
Eli Lilly & Company. As always, Nature retains sole 
responsibility for all editorial content. 
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COLORECTAL CANCER 
A DISEASE OF GROWTH 


Colorectal cancer occurs throughout the world but is most common in developed countries. As heavily 
populated countries such as China undergo rapid economic development, the incidence of the disease looks 
set toincrease. An animated version of this infographic is at go.nature.com/wgiqvp. By David Holmes. 


DISEASE AND DEVELOPMENT 


More than half (55%) of the cases of colorectal cancer occur in developed regions, but 
developing countries are catching up. As the economies of countries such as Brazil, 
China and India grow, so does their incidence of colorectal cancer’. 
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The highest regional estimated incidence 
rates are in Australia and New Zealand, 


Four of the five South American countries that score where the disease is almost tenfold more MM >263 4.9-8.3 
highest on the UN Human Development Index — Argentina, common than in West Africa; 9 of the 20 MN 14.4-263 <49 
Brazil, Chile and Uruguay — are also in the top five South lowest-ranked countries in the UN Human 

American countries for colorectal cancer incidence. Development Index are in West Africa. MM 8.4143 No data 


@ FRANCE 


Global trends in colorectal cancer closely 
follow economic fortunes’. Incidence in 
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Looking further east, Japan experienced a 
rapid rise in cases between 1990 and 1995, 
‘ollowed by almost ten years of steady or 
alling incidence. This period, termed ‘the lost 
decade’ by economists, saw falling wages and 
economic stagnation. By contrast, the 
breakneck development of Japan’s neighbour 
ical China was accompanied by a dramatic rise in 
2005 colorectal cancer incidence. 
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AMERICAN CANCER SOCIETY 


Ascending colon 


Descending 
colon 


A PREVENTABLE EPIDEMIC 


DIET 


Many foods have been linked with 
increased (animal fat, sugars) or 
decreased (garlic, fibre, calcium) risk 
of colorectal cancer, but the most 
compelling evidence for increased 
risk relates to the consumption of red 
and processed meats*. 


GENETICS 


Having a first-degree relative with 
colorectal cancer increases the risk of 
the disease by 80%. By the age of 40, 
almost all adults with familial 
adenomatous polyposis’ will have 
colorectal cancer, and BRCA1 
mutations may also increase risk. 


OBESITY 


Closely linked to diet and physical 
inactivity, obesity also increases the 
risk of colorectal cancer. One 2013 
study! found that obese people have 
a 33% higher risk of colorectal cancer 
than people of healthy weight. 
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One of the most crucial factors for 
colorectal cancer is age. Up to 90% of 
all colorectal cancers occur in people 
aged 50 years and over®. 


EXERCISE 


The World Cancer Research Fund and the 
American Institute for Cancer Research 
both identify exercise as protective against 
colorectal cancer. Colorectal cancer risk is 
17-24% lower in the most physically 
active people compared with the least 
physically active ®. 


SCREENING 


Screening finds precancerous polyps that 
can be removed before they become 
cancerous. The problem is ensuring access. 
In the United States, for example, one-third 
of adults over 50 have not been screened 
appropriately. 


The decline in colorectal cancer 
mortality in the United States has 
not been shared equally". In the 
1980s, African Americans died at 
a 13% higher rate than white 
Americans from the disease. Two 
decades later, they died at a 53% 
higher rate. This disparity is 
caused by factors including 
differences in how likely they are 
to receive the latest treatments, 
and the prevalence of other 
health problems. 


Blood vessel 


Lymph node 


HOW IT SPREADS 


Colorectal cancer affects the colon and the rectum, parts of 
the digestive system known as the large intestine. It usually 
begins as a non-cancerous growth called a polyp on the 
inner lining of the intestine. 


WHAT ARE THE STAGES OF 
COLORECTAL CANCER? 


G) Cancerous cells form on the inner lining of 
the large intestine. 


Cancer cells grow into the wall of the colon 
or lymph vessels. 


(3) Cancer cells penetrate the blood or the 


lymph vessels. 


nodes, and can also be carried in the 
blood vessels to other organs. 


(4) Cancer cells spread into nearby lymph 
8. Robsahm, T. E. et al. Eur. J. Cancer Prevent. 22, 492-505 (2013); 9. Boyle, T. et al. J. Nat! Cancer Inst. 104, 1548-1561 (2012); 10. Ma, Y. et al. PLoS One 8, e53916 (2013); 11. Siegel, R., DeSantis, C. & Jemal, A. CA Cancer J. Clin. 64, 
104-117 (2014); 12. Robbins, A. S., Siegel, R. |. & Jemal, A. J. Clin. Oncol. 30, 401-405 (2012). 
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Colonoscopy is often the best method for identifying colon cancer at an early stage — but it is invasive. 


Early alert 


Scientists are developing an array of choices for screening 
colorectal cancer, but patients often choose to go without. 


BY CASSANDRA WILLYARD 


identical white boxes to the back door 

of a nondescript building in Madison, 
Wisconsin. Inside each box is a miniature pail 
filled with human excrement. This faecal mat- 
ter is bound for Exact Sciences’ 3,000-square- 
metre laboratory, a sparklingly clean facility 
that analyses each sample for the genetic mark- 
ers of colorectal cancer. 

The test, called Cologuard, was approved by 
the US Food and Drug Administration (FDA) 
in mid-2014. It comes at a time when public- 
health officials in the United States are desper- 
ately trying to get more people screened. 

Only about two-thirds of those eligible for 
colorectal-cancer screening in the United States 
have been tested. In March 2014, a coalition of 
organizations dedicated to reducing colorectal- 
cancer deaths announced a target of 80% by 
2018. Exact Sciences provides its at-home test 
as an alternative for people who are not willing 
to have a colonoscopy, an invasive procedure 
that requires sedation and a clean bowel. 

Although colonoscopy is by far the most 


Bi= day, a UPS truck delivers dozens of 
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common screening option in the United 
States, researchers have yet to demonstrate its 
effectiveness in a randomized controlled trial. 
Other countries have embraced cheaper, less- 
invasive tests that have proved to be effective. 
“Our goal is to be able to have the most 
people screened with an effective test,’ says 
Douglas Corley, a gastroenterologist at the US 
health-care provider Kaiser Permanente in 
Oakland, California. This might mean com- 
bining existing tests to create more effective 
screening programmes, or looking for new 
strategies that are not only effective but also 
appealing enough for people to get tested. 


KING COLONOSCOPY 
In the 1980s, colonoscopies were rare. Instead, 
the American Cancer Society recommended 
either sigmoidoscopy, a less-invasive proce- 
dure that uses a shorter scope to view only 
the bottom portion of the colon, or the faecal 
occult blood test (FOBT), which identifies 
blood in the stool — a potential sign of cancer. 
The FOBT uses a chemical called guaiac to 
detect the haem component of the oxygen- 
carrying protein haemoglobin. Early FOBTs 
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were poor at detecting malignancy, but more 
recent high-sensitivity versions detect between 
50% and 79% of colorectal cancers that are 
found with colonoscopy. A variant of FOBT 
called a faecal immunochemical test (FIT), 
which uses an antibody to detect globin in the 
stool, can pick up between 55% and 100% of 
cancers detected with colonoscopy. But stool 
tests do not involve peering inside the colon and 
so are less effective at detecting precancerous 
polyps — a major drawback because detecting 
and removing these growths can prevent colo- 
rectal cancer from occurring in the first place. 

Oncologist Alfred Neugut of Columbia 
University Medical Center in New York was 
one of the first to suggest, back in 1988, that 
colonoscopy might offer better cancer detec- 
tion. Neugut argued that the long colonoscopy 
tube would allow physicians to screen a larger 
portion of the colon than was possible with sig- 
moidoscopy, so it would detect more cancers. 

In 2000, US TV news personality Katie 
Couric, whose husband had died of colorectal 
cancer, encouraged her audience to have colon- 
oscopies by undergoing the procedure on her 
show. In the same year, The New England Jour- 
nal of Medicine published two studies'* show- 
ing that colonoscopy can detect cancers missed 
by sigmoidoscopy. An editorial noted that 
“relying on flexible sigmoidoscopy is as clini- 
cally logical as performing mammography of 
one breast”. The following year, federal health- 
insurance programme Medicare and many pri- 
vate insurers began paying for the procedure as 
a screening tool for colorectal cancer. 

The US Preventive Services Task Force, an 
independent panel that issues evidence-based 
screening recommendations, lists three accept- 
able methods: FOBT alone, FOBT in combi- 
nation with sigmoidoscopy, and colonoscopy. 
However, colonoscopy is the most common. 
According to 2012 data from the US Centers 
for Disease Control and Prevention, 65% of 
adults aged 50 to 75 reported being up to date 
with screening. The number screened by a 
colonoscopy within the past 10 years exceeded 
by more than 6-fold the number screened by 
FOBT in the past year, and by more than 60-fold 
the number screened with the FOBT-sigmoi- 
doscopy combination. 

If people could have only one screening test 
once in their lifetime, colonoscopy would be the 
clear winner, says Corley. But colorectal-cancer 
screening is not supposed to be a one-off event. 
The American Cancer Society recommends 
that people between the ages of 50 and 75 at 
average risk of the disease have either a colon- 
oscopy every 10 years, a sigmoidoscopy every 
5 years, or a stool test every year. 

Colonoscopy brings a risk of bleeding and 

bowel perforation, 
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The range of screening methods for colon cancer includes (left to right) stool blood test kits, faecal immunochemical tests and the Cologuard stool DNA test. 


Three large, randomized trials have begun 
to study the value of colonoscopies, but they 
are unlikely to yield results for many years. 
By contrast, several trials have shown con- 
sistent results for FOBT and sigmoidoscopy. 
Sigmoidoscopy has been found to lower colo- 
rectal-cancer mortality by 22-31%, and FOBT 
reduces it by 15-33%. 

Crucially, colonoscopies only work if people 
have them. In 2008, the US Preventive Services 
Task Force modelled the effectiveness of vari- 
ous testing strategies. With adherence at 80%, 
colonoscopy screening offered the greatest gain 
in life-years, followed closely by high-sensi- 
tivity FOBT and FIT. But when colonoscopy 
adherence was only 50%, it was no more effec- 
tive than FIT, high-sensitivity FOBT, or either 
of those combined with sigmoidoscopy. 

Other countries are keener to try the alter- 
natives. In Australia, the National Bowel Can- 
cer Screening Program sends FOBTs to people 
to use at home as they turn 50, 55, 60, 65, 70 
and 74. Since the programme began in 2006, 
more than 4.6 million Australians have been 
offered the test, and 1.8 million have returned 
it for analysis. The goal is for all Australians 
between the ages of 50 and 74 to receive the 
test every two years by 2020. 

The National Health Service (NHS) Bowel 
Cancer Screening Programme in England also 
favours FOBT. “It’s recognized that colonos- 
copy is the gold standard,” says Sally Benton, 
associate director of the NHS bowel-cancer 
screening hub in Guildford, UK. But the coun- 
try lacks the resources to implement it. 

In 2010, a team of UK researchers demon- 
strated the benefit of one-off sigmoidoscopy for 
colorectal-cancer screening’, and the country is 
now running a pilot scheme for the procedure. 
Participants receive an invitation for sigmoi- 
doscopy at the age of 55, and will receive FOBT 
kits every two years from the ages of 60 to 69. 


DETECTING DNA 

Cologuard provides another screening option. 
The stool test combines FIT with analysis to 
detect specific genetic markers — mutations 
in KRAS, a gene involved in cell division that 
is often mutated in colorectal cancer, and 


chemical modifications of two other genes 
associated with the disease. These markers 
provide a signature of the presence or absence 
of cancer, says David Ahlquist, a gastroenter- 
ologist at the Mayo Clinic in Rochester, Min- 
nesota, who helped to develop Cologuard. 
“While many cancers and polyps dont bleed, 
they all shed cells.” 

Cologuard is not intended to replace colon- 
oscopy — anyone who receives a positive result 
using any stool test still needs a colonoscopy to 
confirm it. But it does seem to have the edge 
over FIT. Ina randomized trial* published in 
2014, nearly 10,000 participants received either 
Cologuard or FIT in addition to a colonoscopy. 
Cologuard detected 92% of colorectal cancers 
compared with 74% for FIT, and found more 
precancerous lesions. But it also delivered more 
false positives: 13% of those who tested positive 
using Cologuard had no sign of cancer or pre- 
cancerous lesions when they received a colon- 
oscopy. And the study did not examine how the 
test would compare to FIT in the long term. 

Douglas Rex, a gastroenterologist at Indiana 
University School of Medicine in Indianapolis, 
points out that about 70% of Cologuard’s per- 
formance is due to its FIT component. What’s 
more, stool DNA tests are recommended once 
every three years, whereas FIT is offered annu- 
ally. If FIT were performed annually for three 
years, as it should be, “you probably would 
make up some of that difference’, he says. 

Then there is the cost. In 2014, Medicare 
agreed to pay US$500 for a Cologuard test 
— much higher than its $5 reimbursement 
for FOBT and $22 for FIT. Even if a person 
took the FIT test annually, as recommended, 
it would still cost much less than Cologuard 
over three years. “It’s a little bit tricky to know 
whether the extra cost is worth it,” Rex says. 


BLOOD TESTS 

To overcome people's queasiness about provid- 
ing stool samples, several companies are devel- 
oping tests that require nothing more than a 
drop of blood. “Many people don’t want to 
touch the stool,’ says Ann Zauber, a biostatis- 
tician at the Memorial Sloan Kettering Cancer 
Center in New York. 


© 2015 Macmillan Publishers Limited. All rights reserved 


German company Epigenomics markets 
one such blood test in Europe, called Epi pro- 
Colon, which looks for a particular chemical 
tag ona gene called SEPT9. A study involving 
8,000 people published in summer 2014 found 
that Epi proColon detected 68% of colorectal- 
cancer cases’, but it had a high false-positive 
rate of nearly 20%. And, Rex points out, the test 
has a lower sensitivity for early-stage cancers 
than for late-stage cancers, which have a lower 
survival rate. For these reasons, he says, “it’s not 
a good test for colon-cancer screening” A 2013 
cost-effectiveness modelling study suggested 
that screening with Epi proColon every two 
years would be less effective and more costly 
than the alternatives. 

But Epigenomics is targeting people who are 
not now being screened. “We want to lower the 
barrier for these patients to enter the screen- 
ing programme, says Thomas Taapken, chief 
executive of the Berlin-based company. “Our 
assumption is that a blood test would do that.” 

In June 2014, the FDA declined to approve 
the test and asked the company to produce evi- 
dence that Epi proColon will increase compli- 
ance. Six months later, Epigenomics launched 
a study in the United States to answer that ques- 
tion. Researchers will invite people who have 
been offered screening but failed to comply to 
come into the clinic, where they will be selected 
at random to receive either a take-home FIT 
test or an Epi proColon blood test. 

Ina perfect world, adults would be screened 
for colorectal cancer when they are supposed 
to be, with a screening method that is proven to 
be effective. But in the real world, compliance 
is rarely perfect. Ultimately, says Zauber, “the 
best test is the one that gets done”. = 


Cassandra Willyard is a freelance science 
writer based in Madison, Wisconsin. 
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PREVENTION 


Tending the gut 


Drugs, lifestyle changes and other measures might lower the risk of colorectal cancer — 
but the evidence is along time coming. 


BY LAUREN GRAVITZ 


Pennsylvania, gastroenterologist Robert 

Schoen has been injecting high-risk 
patients with a vaccine against colorectal can- 
cer. With a series of three shots, he hopes to 
prime their immune systems so that they rec- 
ognize, tag and destroy the disease in its earliest 
stages, before it can take hold and spread. 

His trials’ — the first to examine whether 
immunotherapy might be useful for preventing 
colorectal cancer as opposed to treating it — 
are part of a swell of research dedicated to can- 
cer prevention that has taken hold in the past 
two decades. 

“Prevention is the most important thing we 
can do from a public-health standpoint, but 
it’s also the most difficult area for getting rand- 
omized proof; says Jason Zell, who studies the 
prevention and treatment of colorectal cancer 
at the University of California, Irvine. State-of- 
the art, double-blind, placebo-controlled trials 
are always difficult when it comes to disease 
prevention. And in the case of cancer, the aim 


I na clinic at the University of Pittsburgh in 
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is to thwart something that may not otherwise 
emerge for decades. 

Colorectal cancer is largely preventable. Over 
the past 30 years, clinical trials and observa- 
tional studies have suggested that everything 
from drugs to nutritional supplements, diet 
and exercise could help to stave off this lethal 
disease. The difficulties, however, lie in identi- 
fying what works, proving that it does, match- 
ing those people most at risk of cancer with 
the most appropriate interventions, and then 
persuading them to take the necessary action. 
And the whole process must be done in a way 
that is overwhelmingly safe. 

Initially, many scientists believed that the 
best hope for preventing colorectal cancer lay 
in common elements of our diet. But even sup- 
plements that originate in everyday foods may 
not be as benign as they initially appear. Folate 
(from which folic acid is derived) may provide 
protection in the earliest stages of disease, but 
it actually seems to encourage the growth of 
tumour cells once they arise. And a large-scale 
trial aimed at preventing lung cancer showed 
that not only was B-carotene ineffective for 
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prevention, it seemed to increase both risk of 
disease and risk of stroke. 

Moreover, few scientists have the funds or 
the institutional structure to power the large 
and lengthy studies required to demonstrate 
effective prevention. “It takes a while for the 
progression of events to happen, and it’s dif- 
ficult to get a trial organized, completed and 
funded that can measure meaningful end- 
points,” says gastroenterologist Paul Limburg, 
director of preventive services at the Mayo 
Clinic in Rochester, Minnesota. Very few 
such studies have been designed for colorectal 
cancer, he says. To test prevention in a more 
realistic time span, most use the best proxy 
they can: decreased incidence of precancer- 
ous polyps called adenomas. 

For now, the best way to prevent colorec- 
tal cancer is a colonoscopy (see ‘Early alert; 
page S4). The can- 
cer almost always 
starts as benign 
polyps that grow 
in the lining of the 
large intestine. The 
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disease is so slow-growing that regular colon- 
oscopies can prevent between 76% and 90% of 
malignant cases from developing into some- 
thing more serious. 

The problem is that not enough people are 
tested. In the United States, only about 65% 
of those for whom regular colonoscopies are 
recommended actually have them. For the 
rest, and even for those who are regularly 
screened, the addition of preventive therapy 
— an approach often referred to as chemo- 
prevention — could save lives. “It makes sense 
to have something in addition to screening,” 
says Limburg. “The ideal chemopreven- 
tive agent could delay screening or increase 
screening intervals.” 

Researchers are trying to figure out the most 
effective approach, but it is complicated — 
there may be many molecular pathways that 
trigger the cancer, and even more potential 
molecular targets. “The number of mecha- 
nisms we have is almost equal to the number 
of scientists investigating the problem,” says 
Ernest Hawk, an oncologist and cancer-pre- 
vention researcher at the University of Texas 
MD Anderson Cancer Center in Houston. 


INVESTIGATING INFLAMMATION 

Despite all this complexity, colorectal-cancer 
researchers have been making progress. For 
colorectal cancer, “more than for any other 
cancer, there is evidence that there are agents 
we can use that would be effective”, says 
Andrew Chan, a who studies colorectal-cancer 
prevention at Harvard Medical School in Bos- 
ton, Massachusetts. 


Early research suggested that the best way 
to stave off colorectal cancer was to inhibit 
enzymes that foster inflammation, such 
as prostaglandin and the cyclooxygenases 
COX-1 and COX-2, which encourage abnor- 
mal cell growth. The anti-inflammatory agent 
with the best long-term data so far is the hum- 
ble aspirin. 

Numerous studies over the past 20 years 
have shown that taking a daily dose of aspirin 
for a decade or more can reduce the risk of 
colorectal cancer by more than 50%, prob- 
ably by inhibiting COX-1 and suppress- 
ing prostaglandin. But the effectiveness of 
aspirin may be limited by genetics. A study” 
published in March 2015 found that aspirin 
and other non-steroidal anti-inflammatory 
drugs (NSAIDs) failed to protect a small per- 
centage of people who had specific genetic 
variations on chromosome 15. In addition, 
taking aspirin in the long term can cause 
gastrointestinal bleeding, so oncologists rec- 
ommend taking it only if a person’s familial 
risk or history of polyps overwhelms the risk 
of bleeds. 

Such side effects have led some groups 
to try to make aspirin safer for chronic use. 
At the University of Texas Health Science 
Center in Houston, Lenard Lichtenberger 
and his colleagues have found that lipids from 
membranes such as those lining the stomach 
and intestines guard the organs from injury. 
Aspirin eats away at that lining, so Lichten- 
berger has combined aspirin with lecithin — a 
natural phospholipid that helps to protect the 
mucous membrane without interfering with 
drug absorption. 

The drug, developed for cardiovascular 
disease prevention by Houston-based PLx 
Pharma, was approved by the US Food and 
Drug Administration (FDA) in 2012 for the 
same uses as over-the-counter aspirin. The 
University of Texas team is conducting ani- 
mal trials to evaluate whether it is as effective 
as aspirin at preventing colorectal cancer. The 
researchers hope to begin clinical trials in the 
next few years. 

Beyond aspirin, the only therapeutics that 
have proven effective at lowering the risk of 
colorectal cancer are other NSAIDs, such as 
ibuprofen and COX-2 inhibitors. But whereas 
the evidence for aspirin is undisputed, these 
NSAIDS are on shakier ground. Their benefit 
is a bit lower — the reduction in disease risk 
is closer to 20-30% — and, like aspirin, most 
increase the risk of gastrointestinal bleeds 
and other side effects. Celecoxib, one of the 
best-studied of the group, decreases the risk 
of bleeding but increases the risk of cardio- 
vascular complications such as heart attack 
and stroke. 

One particularly promising NSAID is sulin- 
dac, an FDA-approved anti-inflammatory 
drug that not only inhibits COX enzymes but 
also helps to remove molecules called poly- 
amines from the epithelial tissue that lines 
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the colon and other organs. Elevated levels of 
polyamines have been associated with carcino- 
genesis, and inhibiting them has prevented 
the development of colon and other cancers 
in mice. 

Zell and his colleagues are investigating the 
preventive power of sulindac combined with 
eflornithine, a drug that inhibits the creation 
of polyamines. In a three-year trial involving 
375 people with a history of precancerous pol- 
yps, the combination of 
sulindac and eflornith- 


“Wesawthat  ‘ 

80%ofcolon reduced the recur- 
cancer was rence of adenomas by 
really looking 70%, and progression to 


advanced adenomas by 
92%, compared with pla- 
cebo*. Now Zell is lead- 
ing a trial of more than 1,300 people to find 
out whether the drugs can prevent the recur- 
rence of colorectal cancer. By studying the two 
drugs in a four-arm trial — one placebo, one 
sulindac-only, one eflornithine only, and one 
both sulindac and eflornithine — Zell and his 
collaborators hope to find out whether the two 
drugs work better together than they do on 
their own. The trial is expected to end in 2019. 


preventable.” 


LIVING PROOF 

Lifestyle, too, has preventive potential. People 
who exercise regularly and who eat a diet 
low in red and processed meats have much 
lower rates of colorectal cancer. Yet although 
healthy habits are consistently associated with 
decreased disease, proving that these activities 
were the cause of lowered risk is difficult. 

Most lifestyle studies are observational in 
nature — they use data from interviews or 
questionnaires to gather information on such 
factors as daily medication, diet and exercise, 
and then assess whether there is a link between 
behaviour and outcome. Graham Colditz, a 
cancer-prevention researcher and epidemi- 
ologist at Washington University in St. Louis, 
Missouri, conducted one such study* on the US 
population. “When we looked at national data 
comparing obesity, inactivity, red-meat and 
processed-meat intake, alcohol and cigarette 
smoking, we saw that 80% of colon cancer was 
really looking preventable,’ he says. 

His study found that specific dietary factors 
seem to be implicated in a small fraction of 
colorectal cancers: about 6% can be attributed 
to eating red meat, and 5% to alcohol con- 
sumption. How they trigger carcinogenesis 
remains unclear. Some research suggests that 
the high iron content in red meat promotes 
carcinogenesis by increasing cell growth, and 
others indicate that one of alcohol’s metabolites 
can interact directly with DNA. One recent 
study drives the point home. Ina population of 
more than 77,000 people who refrained from 
smoking and drinking, those who ate fish but 
no other meat seemed to cut their risk of colo- 
rectal cancer nearly in half’. 

Obesity and physical inactivity also 
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Exercise can improve glucose metabolism and reduce the risk of developing colorectal cancer. 


contribute significantly to colorectal-cancer 
risk, and here researchers have more confi- 
dence in the mechanism: they blame glucose 
metabolism. High levels of insulin, which the 
body uses to absorb glucose from the blood, 
seem to stimulate the growth of cancer cells, 
and the colon seems particularly sensitive to 
increased insulin. But exercise can halt or even 
reverse that cycle. The more physically active 
someone is, the more sensitive the body’s cells 
become to the insulin, improving their ability 
to metabolize glucose. Moreover, contracting 
muscles use up more glucose. “We've looked 
at blood markers for insulin and seen that 
they relate to subsequent risk of colon cancer,” 
Colditz says, “and exercise has a direct impact 
on glucose metabolism.” 

Among other research, data from the Har- 
vard Nurses’ Health Study — a long-term pro- 
ject that followed more than 200,000 nurses 
to assess how various factors influence health 
— founda strong connection between exercise 
and reduced risk of colorectal cancer, for both 
primary and recurring cancer. 

The Canadian CHALLENGE trial, led by 
Kerry Courneya at the University of Alberta 
in Edmonton, is recruiting people without 
disease who have previously been treated for 
either stage II or stage III colorectal-cancer to 
find out whether they benefit from a physical- 
activity-based intervention. The researchers 
hope that this 6-8-year study will yield the most 
definitive answer yet about how exercise affects 
disease risk in colorectal-cancer survivors. 


BUILDING A BETTER SHIELD 

The link between insulin metabolism and 
cancer is prompting others to ask whether 
a medication that is already available could 
help to avert the disease. Michael Pollak, an 
oncologist at McGill University in Montreal, 
Canada, is testing metformin, a diabetes drug 
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that suppresses excess glucose production. 
Metformin is one of the most commonly pre- 
scribed drugs in the West and has a long and 
proven safety record. 

Its effects on colorectal-cancer prevention, 
however, are still unknown. What limited 
evidence there is comes from in vitro studies 
showing that, in the presence of metformin, 
epithelial cells from the colon proliferate more 
slowly — the opposite of what happens when 
cells start to turn cancerous. Metformin also 
has been shown to increase glucose uptake in 
the colon. 

Pollak has confirmed the first finding in 
studies using human patients: taking met- 
formin for a month significantly decreased 
the turnover of colon 


epithelial cells. But «[pereisa 
oo eee lot of interest 
studies to validate the 5 diot and 
preventive effect are sunnlenonts 
still a way off. “We PP 

for cancer 


certainly know that 
metformin has an 
excellent safety record 
— we have a million person-years of fol- 
low-up with it,” Pollak says. “We would 
love to put together a prevention trial, but 
that’s a big undertaking and people want 
more evidence.” 

Two dietary supplements also raised hopes 
for prevention: calcium and vitamin D. But 
results have been mixed. Clinical epidemi- 
ologist John Baron at the University of North 
Carolina in Chapel Hill presented disap- 
pointing results at the American Association 
for Cancer Research conference in 2014. In 
a trial of more than 2,200 participants, he 
looked at the preventive potential of calcium 
and vitamin D, but found that neither of the 
supplements, nor a combination of the two, 
significantly reduced adenomas. “There are a 


prevention.” 
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lot of observational data suggesting that vita- 
min D would be dynamite,’ Baron says. “The 
results are a big disappointment.” The same, 
he says, was true of calcium. 

Such setbacks have left researchers feeling 
frustrated. “There is a lot of interest in diet and 
supplements for cancer prevention, particu- 
larly for cancers of the gastrointestinal tract) 
says Chan. “To date, however, there really are 
not any home runs.” 

Among all these unproven approaches to 
preventing colorectal cancer, the most intrigu- 
ing may be the vaccine being developed at the 
University of Pittsburgh. Schoen believes that 
the immunotherapy approach that is gain- 
ing traction for cancer treatment — teaching 
a patient’s immune system to recognize and 
attack antigens specific to cancer cells — can 
also stop colorectal cancer from developing in 
the first place. A vaccine given during the earli- 
est stages of the disease, before adenomatous 
polyps have had the chance to progress into 
something more insidious, would prompt the 
immune system to develop an antibody against 
antigens expressed on precancerous cells. 

Schoen’s work focuses on a protein called 
MUCI, which is present on colorectal adeno- 
mas. In an early trial involving 40 people who 
had had advanced adenomas removed, the 
researchers administered three doses of the 
vaccine and measured the immune response. 
A year later, they gave the patients a booster. 
They found that nearly half of the subjects 
showed a significant immune response’. 
According to Schoen and his colleagues, this 
‘memory’ means that if a vaccinated person 
develops an adenoma that expresses MUC1, 
the immune system could potentially recog- 
nize the protein and attack the cells expressing 
it, preventing cancer from developing. 

The initial results were so promising that 
Schoen and his colleagues are now conduct- 
ing a multicentre trial to determine whether 
the vaccine can prevent the recurrence of 
adenomas. 

In the end, many researchers believe that the 
best option for preventing colorectal cancer 
might be a combination of several approaches. 
“Prevention may not be one-size-fits-all,” 
says Limburg. “In fact, it probably ism’t” But 
with the right mix — and unlike with other 
cancers, such as those of the pancreas or the 
blood — it may well be possible to prevent 
colorectal cancer. m 


Lauren Gravitz is a freelance science writer 
based in Hershey, Pennsylvania. 
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Q&A Victor Velculescu 
Out for blood 


Oncologist Victor Velculescu, co-director of cancer biology at the Johns Hopkins Sidney Kimmel 
Comprehensive Cancer Center in Baltimore, Maryland, describes how circulating tumour DNA 
can be used to improve the detection and treatment of colorectal cancer. 


What is circulating tumour DNA (ctDNA) and 
how might it help in cancer therapy? 

In ctDNA, fragments of DNA that escape 
from tumours into the bloodstream provide 
genetic clues that help us to detect and analyse 
the disease. These clues are especially valuable 
when tissue biopsies are not available, both at 
the start of treatment and when monitoring 
patients as therapy progresses. 


Is ctDNA research still mainly in the lab? 

We are moving to clinical applications. One 
application is looking at late-stage cancers. 
In many cases, tissue analysis is difficult or 
impossible, so a blood-based biopsy could 
be very helpful. We have shown in colorectal 
cancer that a liquid-biopsy approach allows us 
to identify patients with amplification of the 
HER2/neu receptor — a target of the drug tras- 
tuzumab (Herceptin). A small percentage of 
people with colon cancer have dramatic ampli- 
fication of HER2/neu, and there are ongoing 
clinical trials that will figure out if it is a good 
target in colon cancer, and whether it will pro- 
vide a therapeutic option for these patients. 


Are you also looking at monitoring patients’ 
responses to treatment? 

Many patients are in clinical trials for a long 
time, and it may not be obvious whether they 
are responding to the investigational agent. 
A liquid biopsy makes it possible to get a 
measure of tumour burden that very nicely 
tracks disease over time. Clinical trials are 
extraordinarily expensive, and often we have 
to wait until the end to see if the therapy has 
worked. Monitoring subjects over the course 
of the trial by using liquid biopsies could pro- 
vide insight into therapeutic responses much 
more quickly. 


Can these liquid biopsies help us to 
understand resistance to drug therapy, by 
picking up clues about new mutations? 

Yes, they allow us to study drug resistance, 
which arises from genetic alterations that 
occur in the tumour over time as it responds 
to the evolutionary pressure of the therapy. 
For example, we looked at a person with colon 
cancer who initially responded to inhibition 
of a protein called epidermal growth factor 
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receptor (EGFR), but over time the tumour 
became resistant. We did a whole-genome 
analysis and found that the tumour had an 
amplification of the MET gene, which provides 
an alternative route for tumours to grow even 
in the presence of the EGFR blockade. In that 
case, we identified not only a mechanism of 
resistance but also a new potential avenue of 
therapy, because a number of MET inhibitors 
are now available or in clinical trials. 


How can ctDNA aid the early detection of 
cancer? 

One example is the early detection of disease 
recurrence. If you knew soon after surgery 
by using a liquid-biopsy test that the patient 
still had residual cancer, you could give some 
type of targeted therapy to help clear out the 
remaining cancer. Ultimately, liquid biop- 
sies could make it practical to look at blood 
from apparently healthy individuals and try 
to determine whether they have a certain 
type of cancer. We should be cautious about 
this, though, because the levels of ctDNA in 
very-early-stage cancer might be below the 
limits of detection. But it will be an exciting 
avenue to investigate because a substantial 
fraction of patients with early, potentially 
curable disease are likely to be detectable in 
this way. 


We are also seeing progress in the analysis 

of circulating tumour cells — cells shed 

from the tumour into the bloodstream that 

are still intact, as opposed to fragments of 
DNA. Are the roles of these two liquid-biopsy 
approaches becoming clear? 

Yes, I think they are complementary. In some 
settings, ctDNA will be more sensitive for diag- 
nostic purposes. On the other hand, circulat- 
ing tumour cells are an extremely powerful 
source of tumour cells when you don’t have 
access to the tumour itself. The challenge is 
to grow them and propagate them effectively 
outside the body. They could be a great avenue 
for testing new therapies. 


Are drug companies starting to adopt ctDNA 
analyses? 

There is a lot of interest from pharmaceutical 
companies, and liquid biopsies are beginning 
to be used as entry criteria for clinical trials. 
Companies are also interested in using liquid 
biopsies to monitor responses to therapy. If 
the analyses they do in their clinical trials are 
successful, companies may want to convert 
some of those tests into companion diagnos- 
tics. When they do, it is important for these 
diagnostic approaches to have the same rig- 
our as the therapies. Like any approach, these 
liquid-biopsy genetic tests, and the regulatory 
framework guiding their use, won't be perfect. 
But we should not let the perfect become the 
enemy of the good. m 
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A biofilm comprising three species of bacterium (green, cyan and magenta) on a colon tumour. 


Microbial mystery 


Gut bacteria have an important but elusive role in the 


formation of colorectal cancer. 


BY SARAH DEWEERDT 


donated samples of their gut bacteria to 
science. Three women had healthy guts, 
but the other donors had colorectal cancer. 
Researchers implanted the human gut microor- 
ganisms into ‘germ-free’ mice, which were deliv- 
ered by Caesarean section and reared in sterile 
cages, and so lacked gut microbes of their own. 
This is acommon technique for studying the 
health effects of the microbiome — the roughly 
100-trillion bacteria that inhabit the gut. But 
the results were unexpected. When the sci- 
entists exposed the mice to a chemical muta- 
gen that causes colorectal cancer, mice given 
microbes from colorectal-cancer patients 
developed fewer tumours than did mice that 
had received bacteria from healthy human 
donors . “It was a very weird result,” says study 
leader Patrick Schloss, a microbial ecologist at 
the University of Michigan in Ann Arbor. 
The findings are emblematic of the puzzle 
researchers face when trying to understand 
the microbiome’s contribution to colorectal 


| ie years ago, four women and two men 
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cancer. “There’s growing evidence for a role of 
bacteria in colon cancer, but it’s not definitive,” 
says Alberto Martin, an immunologist at the 
University of Toronto, Canada, who has stud- 
ied microbial interactions with a biochemi- 
cal pathway involved in an inherited form of 
colorectal cancer. 

It makes intuitive sense that the gut micro- 
biome has some role in colorectal cancer, 
whether for good or for ill. The idea is in line 
with evidence linking the microbiome to 
liver cancer and to other aspects of health and 
disease, such as obesity and the regulation of 
metabolism. Microbes are also known to be 
involved in the initiation and progression of 
inflammatory bowel disease, which is a risk 
factor for colorectal cancer. 

What is more, most of the bacteria in the 
human gut make their home in the colon, or 
large intestine. “You almost never see cancer 
in the small bowel,” says Jun Sun, a biochemist 
at Rush University in Chicago, Illinois. “Most 
bowel cancer originates in the colon.” In other 
words, the cancer occurs where the bacteria 
reside. 
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There is more to the connection between 
the microbiome and colorectal cancer than 
theory and logic. Over the past decade or so, 
experimental evidence from animal studies, 
comparisons of gut bacteria from people with 
and without colorectal cancer, and analyses of 
the biochemical workings of individual bacte- 
rial strains all point toa link. But a full picture 
of how the microbial community in the gut 
acts and reacts during the development and 
progression of colorectal cancer remains elusive. 

Many studies have documented differences 
between the microbiomes of people with colo- 
rectal cancer and people with healthy guts. 
The patterns vary widely from study to study, 
however, so scientists cannot say much about 
the microbiome abnormalities, or dysbiosis, in 
colorectal cancer. The most consistent finding 
is a high level of Fusobacterium, which is com- 
monly found in the mouth but rarely in the 
healthy gut. Several studies have also shown 
that people with colorectal cancer have higher 
than normal levels of Escherichia coli bacteria’. 

But colorectal cancer can take decades to 
develop, so such differences may reveal little 
about the origin of the disease, says Christian 
Jobin, a microbiologist at the University of 
Florida in Gainesville. “What was cooking in 
their gut before the cancer diagnosis?” he asks. 

Researchers have tracked changes in the 
microbiome throughout the course of colorec- 
tal-cancer development in mice, but there have 
been no such longitudinal studies in people. 
In 2014, however, Schloss and his colleagues 
reported differences between the microbiomes 
of three groups of people: those with healthy 
colons, those who had precancerous lesions 
called adenomas, and those with colorectal can- 
cer. Adding microbiome data to conventional 
risk factors, such as age, race and body mass 
index, improved the ability to predict which of 
the three clinical categories a person belongs to 
— hinting at the potential to use microbiome 
analysis in colorectal-cancer screening. 

But the cause-and-effect relationships still 
remain unclear. “What comes first — is it the 
cancer, the disease process, or the dysbiosis?” 
asks Shahid Umar, who studies gut bacteria at 
the University of Kansas in Kansas City. “That's 
the million-dollar question.” 


OF MICE AND BACTERIA 
Direct evidence that gut-dwelling microbes 
help to cause colorectal cancer comes mainly 
from studies on mice. For example, Schloss 
and his colleagues transferred the gut micro- 
biome from mice with colorectal cancer into 
germ-free mice, and then exposed these mice 
to a chemical mutagen. As long as this tech- 
nique is used to transfer gut microbiota within 
a species, the results are straightforward: mice 
that received their microbiome from donors 
with cancer developed more and bigger 
tumours than mice that received their micro- 
biome from healthy donor mice. 

The researchers also showed that mice 
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exposed to the chemical mutagen do not 
develop tumours if they receive a dose of anti- 
biotic that kills off some of the microbiome, 
suggesting that the microbiome plays a part in 
the onset of cancer. And they found that mice 
that receive a cancer-causing microbiome do 
not develop tumours if they are not exposed to 
the mutagen — so the mutagen is essential too. 
Other research groups have also found that 
cancer does not develop without the presence 
of certain bacteria, but that these bacteria do 
not, on their own, trigger tumours. 

To unravel the biochemical mecha- 
nisms underlying these patterns, 
scientists have simplified matters by 
investigating individual bacterial spe- 
cies. This research has focused mainly 
on three types of bacterium: Fusobac- 
terium; strains of E. coli that produce 
colibactin toxin; and toxin-producing 
strains of Bacteroides fragilis (a com- 
mon cause of diarrhoeal illness). All 
three of these bacteria increase tumour 
formation in mouse models that are 
susceptible to colorectal cancer. For 
example, cancer-prone mice that are 
raised ina sterile environment and then 
colonized with Fusobacterium develop 
more tumours than mice not exposed 
to the bacterium. 

Not all species of bacteria have this 
carcinogenic effect, however. Jobin’s team 
has shown that immune-compromised mice 
colonized with either E. coli or Enterococcus 
faecalis (another common gut species) develop 
colon inflammation, but only the mice that 
receive E. coli develop tumours. Inflamma- 
tion is clearly important: transfer E. coli into a 
mouse that cannot produce an inflammatory 
response, and no tumours appear. It seems 
that an inflammatory environment in the 
gut changes the pattern of E. coli gene activ- 
ity, Jobin says. To confirm this hypothesis, 
he has shown that deleting these inflamma- 
tion-induced genes from E. coli removes the 
microbe’s ability to cause cancer’. The carci- 
nogenic effects of B. fragilis also depend on 
inflammation, according to Cynthia Sears 
and her colleagues at Johns Hopkins Univer- 
sity in Baltimore, Maryland, who showed that 
the bacterium does not produce intestinal 
tumours when interleukin (IL)-17, an inflam- 
matory molecule, is blocked*. Sears says that 
IL-17 is important in tumorigenesis in a variety 
of other body sites as well. 

These animal studies paint a clear picture. 
“All this tells us that it’s the microbiome,’ says 
Schloss. “And also importantly, it tells us that 
it’s not just the microbiome” 


HUMAN VARIATION 

Demonstrating that similar processes occur in 
humans has been problematic, as illustrated 
by Schloss’s study that found fewer tumours, 
not more, in mice that received microbiomes 
from human colorectal-cancer patients. But 


transferring microbiomes between species is 
an unnatural process and might be expected 
to throw up murky results. 

Even so, there is some evidence that links 
Fusobacterium, B. fragilis and E. coli to colo- 
rectal cancer in humans. For example, Fuso- 
bacterium is more abundant in precancerous 
adenomas than in other sites in the colon. And 
the gene encoding B. fragilis toxin is present 
in the colons of 90% of people with colorectal 
cancer but in about half of the colons of healthy 
people. Sears points out that this is similar to 


Transmission electron micrograph of bacteriophages, which could 
be used to kill strains of bacteria implicated in colorectal cancer. 


observations for stomach cancer, which is asso- 
ciated with the bacterium Helicobacter pylori. 

But it would be an oversimplification to 
think of the microbiome’s role in colorec- 
tal cancer primarily in terms of a patho- 
genic infection by one or two troublesome 
microbes. “There are cases where individual 
bacteria seem to be associated with tumours,” 
Schloss says. But far more often, he says, there 
is a “community effect” — a more generalized 
dysbiosis that may enable an individual cancer- 
promoting bacterium to gain a foothold, or to 
alter the microbiome’s overall function in a way 
that leads to tumour formation. 

Dysbiosis can have many different causes, 
but a common one is diet. High-fat Western 
diets have been linked to the development of 
colorectal cancer. Studies into the role of diet 
and fibre intake on colorectal cancer have 
yielded inconsistent results. Some researchers 
say this is because these studies have not taken 
into account people's different microbiota. 

For example, individuals differ in their 
microbiome’s capacity to produce butyrate, a 
fatty acid made by certain microbes when they 
break down dietary fibre. Butyrate is impor- 
tant for the health of the intestinal epithelium, 
which is one of the body’s most rapidly renew- 
ing tissues — Umar points out that all the cells 
in the epithelium are replaced every 3-5 days. 

Butyrate fuels this energy-intensive process 
and acts as a potent anti-inflammatory agent. 
It also feeds into important biochemical path- 
ways, such as the Wnt and Notch pathways, 
which regulate epithelial-cell proliferation 
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and have been implicated in colorectal cancer. 
“You need very controlled and regulated activ- 
ity of these pathways,” Umar says. Changes in 
microbial products — either the absence of 
butyrate or the presence of other metabolites 
— can affect these pathways and increase the 
proliferation of gut epithelial cells, a key step 
towards cancer. 

Researchers should also consider the genetic 
background of individuals who have a partic- 
ular community of microbes. Studies of mice 
with a genetic defect linked to a familial colo- 
rectal-cancer syndrome suggest that 
in individuals with this genetic back- 
ground, butyrate can actually promote 
the development of colorectal cancer. 

About 20% of colorectal cancers have 
similar genetic defects, which affect 
DNA-mismatch repair’. This means that 
it is crucial to carefully select the patients 
or subjects used in studies if we are to 
understand the effects of the microbi- 
ome on colorectal cancer, says Martin, 
who led the mismatch-repair work. 

Unravelling all these effects is only 

the first step to understanding what 
causes colorectal cancer. “What we 
are after is not so much the nature of 
this dysbiosis, but whether we can do 
something about it,” Jobin says. Some 
scientists have tentatively wondered 
about using antibiotics to knock out cancer- 
promoting gut microbes but worry about the 
potential effects on the microbiome as a whole. 
Jobin has a different idea: develop bacterium- 
killing viruses known as bacteriophages to 
target specific strains of bacteria that are 
implicated in cancer while leaving the rest of 
the microbiome to rumble along as usual. 

Other scientists suggest that diet or pro- 
biotics (microorganisms thought to provide 
health benefits) could potentially manipulate 
the composition of the microbiome. How- 
ever, this strategy needs a lot of work because 
researchers have had difficulty using probiot- 
ics to induce stable, long-term changes in the 
microbiome. 

But microbiome interventions have shown 
promise for treating other gastrointestinal con- 
ditions. For instance, persistent infection with 
the bacterium Clostridium difficile responds to 
the unlikely intervention of faecal transplanta- 
tion. Such examples offer hope that one day 
a microbial solution might also be found for 
colorectal cancer. = 


Sarah DeWeerdt is a freelance science writer 
based in Seattle, Washington. 
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Faecal samples contain cells that can be tested for genetic mutations associated with colorectal cancer. 


DRUG DEVELOPMENT 


Mix and match 


Doctors face a maze of drug options and genetic markers 
to find the right treatment for people with advanced 


colorectal cancer. 


BY MEGAN SCUDELLARI 


irst, the good news: since 2004, the 
arsenal of approved drugs to fight 


colorectal cancer has more than dou- 
bled. Five targeted therapies newly approved 
by the US Food and Drug Administration 
(FDA) have joined four established chemo- 
therapeutic agents that were already on phar- 
macy shelves. Together, these drugs have 
extended average overall survival for advanced 
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colorectal cancer from 15 months to 30 months. 

And the bad news? That figure for average 
survival has not climbed above 30 months. The 
new drugs have been of “marginal, incremental 
benefit”, says Scott Kopetz, a colorectal-cancer 
specialist at the Uni- 
versity of Texas MD 
Anderson Cancer 
Center in Houston. 
“They are not home 
runs.” And a home 
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run is badly needed, because colorectal can- 
cer is the fourth leading cause of cancer-related 
death globally (third in the United States). 

Surgery is used when colorectal cancer is 
diagnosed early and achieves an impressive 
90% cure rate. But the other 10% of patients, 
and those for whom the disease was not 
detected early, develop advanced disease, 
marked by metastatic spread of tumour cells 
to the lymph nodes and other sites in the body. 
The 5-year survival rate plummets to just 11% 
for those patients whose disease has spread to 
distant organs. 

Late-stage colorectal cancer is treated with 
drugs, usually a combination of general chem- 
otherapy and a molecularly targeted therapy 
—a drug that interferes with a specific gene or 
molecule in a cancer cell. “In stage-4 disease, 
it takes an all-hands-on-deck approach,” says 
Adam Snook, who develops treatments for 
colorectal cancer at Thomas Jefferson Uni- 
versity in Philadelphia, Pennsylvania. 

Targeted therapies give doctors more 
options for treating advanced colorectal can- 
cer, but that very pool of choices provides 
the field’s current challenge — figuring out 
which drugs to give to which patients. No two 
tumours are the same, and patients can have 
very different responses to the same drug, so 
clinicians find themselves involved in an elabo- 
rate guessing game to keep their patients alive. 
If scientists were able to link tumour charac- 
teristics, such as DNA mutations and gene- 
expression profiles, to positive drug responses, 
that would lead to diagnostic tests to select the 
best treatment for a given patient. 

“We can now target virtually any kind of 
pathway and have the drugs available,” says 
Thomas Seufferlein, director of internal medi- 
cine at Ulm University in Germany, who runs 
clinical trials for colorectal-cancer therapies. 
“Now we really need to match the treatment 
to the tumour.” 

Researchers are taking a three-pronged 
approach to try and overcome the 30-month 
plateau for colorectal-cancer survival. First, 
they are testing various combinations of 
approved therapies in clinical trials. Second, 
the recent identification of four molecular 
subtypes of colorectal cancer means that 
new biomarkers might help to identify which 
drugs are appropriate for individual patients. 
Finally, the effort to discover new drugs is con- 
tinuing, with strategies that include targeting 
biochemical pathways in tumours that have 
so far resisted attack, as well as activating the 
immune system against the cancer. 


DRUG ROULETTE 

The first drug proven to work against colorec- 
tal cancer — 5-fluorouracil (5-FU) — arrived 
on the scene in 1962. For decades, it was the 
only effective colorectal-cancer drug. The late 
1990s and early 2000s saw the development of 
three other chemotherapeutic agents, oxali- 
platin, capecitabine and irinotecan, which all 
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inhibit DNA synthesis and stop cancer cells 
dividing (see ‘Drug options’). When com- 
bined with 5-FU, these drugs made up the 
first chemotherapy combinations for colorec- 
tal cancer, with names that read like a strange 
game of Scrabble: FOLFOX, FOLFIRI and 
FOLFOXIRI. 

Targeted therapies first appeared in 2004, 
starting with the monoclonal antibody beva- 
cizumab, which inhibits the growth of new 
blood vessels. Bevacizumab was followed by 
a string of kinase inhibitors, which block the 
chemical messages that encourage cell divi- 
sion. “Each agent on its own has maybe not 
been that impressive, but combinations have 
greatly improved survival,’ says Snook. 

“But; he quickly adds, “there’s still a long 
way to go.” That’s because no single combina- 
tion has successfully treated the wide diversity 
of tumours. DNA and RNA sequencing of can- 
cer-cell genomes reveals that colorectal cancer 
is not a single disease, but rather many types 
of cancer that are instigated and propelled by 
different mutations in different people. There 
is no one-drug-fits-all solution. For now, com- 
bination therapy is the best option. 

Part of the reason why combinations are 
necessary is the sheer amount of redundancy 
of cancerous pathways. Colorectal cancers, like 
many other cancers, can compensate for loss. 
If you target one pathway with an inhibitor, 
the tumour will become resistant by mutating 
and upregulating another pathway to per- 
form a similar function 
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example, found that 
general chemotherapy 
followed by a combination of two targeted 
therapies caused serious side effects in more 
than half of the patients, including bowel 
bleeding and holes in the walls of the intestine, 
with no difference in overall survival’. 

So clinicians walk a fine line between trying 
to prevent tumour resistance and avoiding tox- 
icity. The best way to navigate that line in the 
future, most agree, is to identify which patients 
will respond best to which combinations. That 
ability requires some sort of biological marker, 
typically a genetic mutation or a specific pat- 
tern of gene expression. 

But colorectal cancer is too complex to 
have just one biomarker — researchers expect 
to find dozens. So far, though, only three bio- 
markers have been identified for advanced- 
colorectal-cancer drug treatment, and all are 
negative biomarkers, which means they indi- 
cate that a patient will not respond to a par- 
ticular therapy. Patients with mutated KRAS, 
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After decades of stagnation, the US Food and Drug Administration has approved several agents for 


metastatic colorectal cancer in recent years. The table shows all approved agents as of December 2013. 
Class Agent Year Characteristics 
Cytoxic 5-fluorouracil 1962 Inhibits the action of an enzyme that synthesizes a 
chemotherapy nucleoside required for DNA replication; listed as one of the 
World Health Organization’s (WHO's) ‘essential medicines’. 
Capecitabine 2001 This ‘prodrug’ is administered in an inactive form; enzymes 
in the body convert it to 5-fluorouracil. 
lrinotecan 2000 Derived from the Asian ‘happy tree’, a deciduous tree native 
to southern China. 
Oxaliplatin 2004 A platinum-based drug that causes DNA to crosslink, 
preventing DNA synthesis. 
Inhibits growth | Bevacizumab 2004 A monoclonal antibody used to treat a variety of cancers; 
of new blood one of the WHO's essential medicines. 
essels 
‘ Aflibercept 2012 First approved for the treatment of macular degeneration. 
EGFR antibody | Cetuximab 2004 An antibody binds the EGFR receptor; it does not work if a 
downstream protein of EGFR, called KRAS, is mutated. 
Panitumumab | 2006 Very similar in activity to cetuximab; a 2014 comparison 
found the two drugs to have similar overall survival benefit 
and toxicity. 
Kinase Regorafenib 2013 The most recently approved drug for colorectal cancer, it is 
inhibitor believed to inhibit at least eight different kinases. 


Source: Sridharan, M. et al. Oncology 28, 110-118 (2014). 


NRAS or BRAF genes do not respond to two of 
the available targeted therapies, panitumumab 
and cetuximab. 

And that is the whole list. No other bio- 
markers exist to guide care for patients with 
advanced colorectal cancer. 


FINDING SUBTYPES 

Oncologists would love to find positive bio- 
markers, indicating that a patient will benefit 
from a particular therapy. If they could use 
molecular information to predict positive 
responses to treatment, they could turn the 
chaotic landscape of colorectal-cancer treat- 
ment into a smooth flowchart of expected out- 
comes. And they havea great deal of molecular 
information, thanks to the availability of next- 
generation sequencing technologies. 

“Tt’s like we stepped out of the age where you 
judge the weather by watching the birds,” says 
Seufferlein. “This is what we did for a long time 
— judge the tumour by looking at X-rays or 
CT scans. Now, we just need to translate the 
molecular data into therapeutic strategies.” 

One of the first steps towards making this 
translation is to better understand the diver- 
sity of the tumours. That is why, three years 
ago, researchers around the globe began cat- 
egorizing colorectal cancer into distinct sub- 
types based on gene-expression patterns. The 
first results came from The Cancer Genome 
Atlas Network in 2012, based on the analysis 
of 276 human colorectal-cancer tumours’. 
The study proposed three classes of colorec- 
tal-cancer tumour. Other teams subsequently 
proposed two, four, five, even six tumour 
subdivisions. 

The jockeying to find the best classification 
was reaching a frenzy when, in a remarkable 
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display of collaboration, more than 15 com- 
peting institutions joined forces to agree on 
the number of subtypes. Kopetz remembers 
being sceptical at the idea that so many dif- 
fering groups, that had previously compet- 
ing conclusions, might find consensus. But 
they did. 

Organized by Sage Bionetworks, a non- 
profit company in Seattle, Washington, that 
specializes in collaborative data analysis, the 
Colorectal Cancer Subtyping Consortium 
used data from more than 5,000 tumour 
samples to identify four molecular subtypes’. 
“Everyone is setting aside their own classifica- 
tions and saying this unified classification is 
what we should all use to move forward,’ says 
Kopetz. “It is tremendous.” 

So far, the consortium has identified some 
subtypes in which particular drug combina- 
tions clearly do not work, but many of their 
results are yet to be published. The goals now 
are to identify the best approved treatments 
for each subtype, and to use the subtypes to 
stratify patients in clinical trials. 


DRUGGING THE ‘UNDRUGGABLE’ 
There is still a huge need for new drugs to treat 
colorectal cancer, particularly therapies that 
are more effective and have fewer side effects 
than the current ones. But much of the phar- 
maceutical pipeline is made up of drugs that 
work in the same way as existing therapies. 
“There are a number of ‘me-too’ drugs, but 
were not really tackling the root of colorectal 
cancer,’ says Federica Di Nicolantonio of the 
University of Turin in Italy, who specializes in 
preclinical drug research for colorectal cancer. 
That root may be one, or both, of two types of 
mutation that pop up in every colorectal-cancer 
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Control mouse 
shows lungs with 
tumour present. 


Mouse treated 
with a therapeutic 
vaccine shows 
tumour-free lungs. 


Immunotherapy has eliminated colorectal-cancer metastases from the lungs in mice (outlined area). 


screen: RAS genes and the Wnt pathway. Nei- 
ther has been successfully targeted with a drug, 
earning them the label ‘undruggable’ 

RAS mutations drive one-third ofall human 
cancers, including 45% of colorectal cancers, by 
causing uncontrollable cell growth. For more 
than 30 years, scientists 


have tried to target the “Wore trying 
three human RAS genes fg bring RAS 
— KRAS, HRAS and i 

NRAS — but they keep pene 


hitting a brick wall. The 
RAS enzymes lack an 
active site that would nor- 
mally be the target for a small-molecule drug, 
says Frank McCormick, who works on RAS 
at the University of California, San Francisco. 
And the rest of the protein’s surface has no deep 
pockets that could make alternative targets. “It’s 
just not a traditional enzyme,’ he says. 

Because RAS mutations play such a large 
part in so many cancers, in 2013 the US 
National Cancer Institute began the RAS 
Initiative, a collaborative effort to better under- 
stand how RAS mutations drive cancer and to 
find new ways to silence the proteins. “We're 
trying to bring RAS back into focus as a drug 
target,” says McCormick. “New technologies 
being developed are enabling us to target the 
protein in ways that were not possible when it 
was first attacked 15-20 years ago.” 

Academic teams are making progress with 
two small molecules that use chemical tricks 
to block RAS function. Both molecules are 
actively moving towards clinical trials in part- 
nership with biopharmaceutical companies, 
says McCormick. 


drug target.” 


$14 | NATURE | VOL 521 | 14 MAY 2015 


Less progress is being made in attempts to 
drug the Wnt pathway, a cascade of proteins 
that pass signals from the outside of a cell to 
the inside. This pathway is normally inactive 
in healthy adult cells but is activated by muta- 
tions in cancer cells. A 2012 analysis by the 
Cancer Genome Atlas Network found the Wnt 
pathway to be altered in 93% of all colorectal 
tumours’. Yet, like RAS proteins, the pathway 
has evaded all attempts to silence it. 

A few Wnt inhibitors are now in early clini- 
cal trials, however. A team at the Icahn School 
of Medicine at Mount Sinai in New York, for 
example, is testing the addition of a soya- 
derived compound called genistein, which 
seems to inhibit a protein in the Wnt pathway, 
to a standard chemotherapy regimen in a phase 
I/II clinical trial. And drugmaker Novartis 
Pharmaceuticals is conducting a phase I trial of 
another potential small-molecule Wnt inhibi- 
tor. So far, there have been no late-stage clinical 
trials for Wnt inhibitors in colorectal cancer. 


ACHIEVING IMMUNITY 

As well as sending in molecules to kill tumour 
cells, scientists are also trying to activate the 
immune system to fight cancer cells. Immu- 
notherapy has shown great promise in the 
treatment of other cancers, but has so far been 
unsuccessful against colorectal cancer. 

One immunotherapy approach that showed 
remarkable results when treating melanoma 
and leukaemia, for example, proved disas- 
trous against colorectal cancer. This ‘adoptive 
cell therapy’ technique involves removing 
immune T cells from a patient’s blood, mul- 
tiplying them in a dish (often after genetically 
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engineering the cells to make them specifically 
attack tumour cells), and injecting them back 
into the patient. In a phase I trial that tested 
the procedure on three people with advanced 
colorectal cancer, all three experienced severe 
inflammation of the colon and had only 
temporary reductions in their tumours’. In 
another case, a person with advanced colorec- 
tal cancer treated with engineered T cells had 
a severe immune reaction and died’. 

Another type of immunotherapy deploys 
antibodies to shut down molecules that nor- 
mally keep the immune system in check, 
unleashing the immune system to attack a 
tumour. But these ‘checkpoint inhibitor’ treat- 
ments have failed in colorectal cancer; in some 
studies, not one patient responded. “It’s been a 
major disappointment,’ says Di Nicolantonio. 

But not everyone has given up hope of using 
immunotherapy to treat colorectal cancer. 
Maybe drug developers have been using the 
wrong antigens — proteins on the surface of 
tumour cells that activate the immune system 
— says Snook, who is leading a clinical trial for 
a therapeutic vaccine. 

Two antigens — CEA and MUCI1 — have 
been widely studied in colorectal cancer and 
tested in therapeutic vaccines for more than 
30 years, with mixed results. Snook’s team 
recently turned to an antigen called GCC, 
which is found in 95% of colorectal-cancer 
tumours. The researchers are testing the safety 
and activity of aGCC therapeutic vaccine, and 
Snook hopes to have results by autumn 2015. 

Immunotherapy may work on its own, or 
it could be combined with other treatments. 
Snook’s team recently showed that radiation 
therapy followed by a therapeutic vaccine sig- 
nificantly amplified the immune response and 
shrank tumours in a mouse model of colorectal 
cancer’. “Neither therapy alone was particu- 
larly effective, but when we combined the two 
together, we saw a great benefit; says Snook. 

Perhaps the future of colorectal-cancer ther- 
apy lies not only in combinations of drugs, but 
in combinations of different treatment types. 

Until then, researchers continue to try to 
incorporate the newly defined colorectal-can- 
cer subtypes and other biomarkers into clinical 
trials to improve the personalized treatment 
of advanced colorectal cancer. “We have all 
the molecular data available,’ says Seufferlein. 
“Now we need to translate it into therapeutic 
strategies” m 


Megan Scudellari is a freelance science writer 
based in Boston, Massachusetts. 
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Q&A Hans Clevers 
Banking on organoids 


In 2009, Hans Clevers and Toshiro Sato (then a postdoc in Clevers lab) demonstrated a powerful 
new model to study development and disease: a three-dimensional organoid’ derived from adult 
stem cells that replicates the structure of cells lining the intestine. More than 100 labs worldwide 
are now working with different types of organoid to study cancer and other diseases. Clevers, at 
the Hubrecht Institute in Utrecht, the Netherlands, discusses the potential of this approach. 


Why might it be better to screen drugs in 
organoids rather than in cell lines? 

We don't currently understand why certain 
tumours are sensitive or resistant to particular 
drugs. With targeted therapies, you can make 
a prediction, but for classical chemotherapy 
drugs, such as cisplatin or 5-fluorouracil, it 
is totally unpredictable which tumours will 
respond. Tumours can be sequenced in great 
detail, but drugs against them cannot be tested 
effectively other than in clinical trials. Orga- 
noids are a very good genetic representation 
of the tumour, so they let us bridge the gap 
between deep-sequencing efforts and patient 
outcomes. 


How do you see organoids contributing to the 
study of colorectal cancer? 

We are collaborating with groups at the Broad 
Institute in Cambridge, Massachusetts, and 
the Sanger Institute in Hinxton, UK, to build a 
biobank of organoids from 20 or so people with 
colon cancer. We have organoids of the cancer 


and of normal cells from individual patients, as 
well as sequences of their protein-coding genes. 
We have established the non-profit Hubrecht 
Organoid Technology (HUB) to expand our 
organoid biobanks. The HUB shares these 
biobanks with academic groups around the 
world, and now works with about 15 compa- 
nies on drug-development programmes. We 
can culture tumours from almost every per- 
son with colon cancer, sequence them and test 
them against drugs. Additionally, we can use 
research techniques that have been developed 
for cell lines, such as genetic tools, fluores- 
cence-activated cell sorting and microarrays. 


Is this research moving towards clinical trials? 

Yes, my group and the HUB are collaborating 
with Emile Voest at the Netherlands Cancer 
Institute in Amsterdam on an observational 
trial. We already have some organoid models 
from people with colon cancer who receive 
chemotherapy. The organoids are screened 
against a panel of common colon-cancer 
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drugs. The patients will be treated the same 
way the oncologists would normally treat 
them, but we'll see if we could have predicted 
the response from our organoids. We're also 
starting another trial in which we will enrol 
advanced-colon-cancer patients, for whom 
there is no standard treatment. We will make 
organoids, test drug sensitivity and resistance, 
and then advise the oncologists as to what drug 
to use for that particular patient. We will be 
looking at multiple drugs, so we need large 
numbers of patients — that’s the only way we 
will be able to produce enough data to help us 
match drugs to tumour types. 


To benefit individual patients, won’t you need 
to test the drugs very quickly? 

Yes — and that’s really where we want to take 
this technology. When you have pneumonia, 
your bacterial cultures are tested and you get 
answers in three days. With this technology, 
we can tell the oncologist the best odds for a 
combination of therapeutics, maybe not in 
three days, but in several weeks. We have an 
organoid-based test in cystic fibrosis that gives 
us a result in about two weeks. 


How does the organoid approach differ from 
patient-derived xenografts, in which patients’ 
tumours are transplanted into immune- 
suppressed mice for testing drugs? 

It's the same principle — you get a functional 
readout of the patient’s tumour. But organoids 
can be tested against an unlimited amount of 
compounds and combinations. Furthermore, 
in contrast to xenografts, organoids can be 
established from almost all patients. 


What are some of the next steps in your 

cancer research? 

Organoids model the key component of the 
tumour but they lack some important ele- 
ments. We want to combine organoids with 
other elements to make more-complete tools. 
For instance, we would like to introduce the 
immune system so that we can study the effects 
of the fantastic new immunotherapy drugs. We 
think that we can build it up in a reduction- 
ist way — take lymphocytes isolated from 
a tumour, bring these together with cancer 
organoids derived from the same tumour and 
watch what happens. And maybe we can also 
put microorganisms in these organoids. For 
example, we could add Helicobacter, a major 
cause of stomach cancer, to stomach organoids. 


Can organoids also help to test drug 
combinations? 

Yes, tumours are genetically heterogeneous, 
and there can be vast differences in drug sensi- 
tivity between clones for the same tumour. We 
can possibly advance sequence-based therapy 
by testing millions of drug combinations in 
organoids. = 


INTERVIEW BY ERIC BENDER 
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5 BIG QUESTIONS 


What drives the 
mutations that cause 
polyp formations 

in non-hereditary 
cancers? 


| QUESTION. = MC WHY ITIS HARD TO ANSWER 


As with other multifactorial 
conditions, there are too many 


* moving parts to make it easy to 


pick out specific culprits. 


Research is attacking 
colorectal cancer on 


many fronts, with 
varying degrees of 


success. But solving 
these five central 
puzzles is likely to 
be crucial. 


BY SHRADDHA CHAKRADHAR 


CURRENT UNDERSTANDING 


Some 5-10% of colorectal 
cancers are caused by heritable 


. mutations. Scientists think that 


about 50% of the rest are caused 
by sporadic mutations. 


“We know that mutations cause 
polyps, but we don’t know what 


: proportion of mutations are 


caused by replicative mutation 
and by environmental factors.” 
Bert Vogelstein, Johns Hopkins. 


What determines 
whether a polyp will 
become cancerous? 


What environmental 
factors contribute to 
the progression of 
disease? 


What makes some 
cancers more 
responsive to certain 
therapies than 
others? 


How can 
immunotherapy be 
harnessed to treat 
colorectal cancers? 


trials take many years. 


Researchers must tease out the 
sequence of mutation events that 


‘. drives this transformation. 


Isolating crucial factors and 
combinations is difficult. Clinical 


Colorectal cancer shows a high 
degree of heterogeneity in 


: mutations, both between and 
© within individual tumours. 


We lack good ways of studying 
colorectal cancer in humans, 


. and animal models (mostly 


mice) are inadequate. Some 


immunotherapies are in trials, but 


any clinical impact is years away. 


Larger polyps are more likely to 
become cancerous. Adenomas, 


: one of the main types of polyp, 


lead to 95% of all colorectal 
cancers, but fewer than 10% of 
adenomas become cancerous. 


Sedentary lifestyles and diets that 


are high in fat and red meat are 


", associated with an increased risk. 


Precise molecular profiles of 
tumours will help to predict the 


5, response to targeted therapies or 


combinations. 


Some tumours have infiltrating 
lymphocytes, which are white 
blood cells that can aid in killing 
the tumour. 


“A lot of the information we have 
is from [people with] genetic 


\ abnormalities, and that only helps 


us infer what happens in normal 
people.” Neil Hyman, University 
of Chicago Medical School. 


“All the empirical studies 
that have tried to determine 


\ what about the diet causes 


cancer haven’t really been that 
successful.” Heidi Nelson, 
Mayo Clinic. 


“We’re throwing the dice — we 
really don’t know what’s going 


‘to work and what isn’t.” 


Vogelstein. 


“Every patient who shows up at 
my door wants to know if we can 


5. activate their immune system to 


kill cancer.’ Charles Fuchs, 
Dana-Farber Cancer Institute. 


Shraddha Chakradhar is a news editor at Nature Medicine. 
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